I 


WADC  TECHNICAL  REPORT  52-204 


A 


.J:Di 

'-4 


I A -r  . 

I-  r ; ;''-4 


Hi /ifh  iUii 


,‘y  f ;■'•/  "‘/'.n 

' ''1  >,J  ■ h ; ^ 

..  ^ ii 

Ubl-  y>f 


> 4ni:^ 


v7 


HANDBOOK  OF  ACOUSTIC  NOISE  CONTROL 
VOLUME  II.  HOISE  and  MAN 


WALTER  A.  ROSEN BLITH 
KENNETH  N.  STEVENS 

AND  THE  STAFF  OF  BOLT,  BERANEK,  AND  NEWMAN 


JUNE  1953 


BRIGHT  AIR  DEVELOPMENT  CENTER 


g,ooiiOi>t\oAA 


Statement  A 

Approved  for  Public  Release 


N0TK3E 


When  Government  drawings,  specifications,  or  other  data  are  used 
for  any  purpose  other  than  in  connection  with  a definitely  related  Govern- 
ment procurement  operation,  the  United  States  Government  thereby  in- 
curs no  responsibility  nor  any  obligation  whatsoever;  and  the  fact  that 
the  Government  may  have  formulated,  furnished,  or  in  any  way  supplied 
the  said  drawings,  specifications,  or  other  data,  is  not  to  be  regarded 
by  implication  or  otherwise  as  in  any  manner  licensing  the  holder  or 
any  other  person  or  corporation,  or  convc3dng  any  ri^ts  or  permission 
to  manufacture,  use,  or  sell  any  patented  invention  that  may  in  any  way 
be  related  thereto. 


WADC  TECHNICAL  REPORT  52-204 


HANBOOK  OF  ACOUSTIC  NOISE  CONTROL 
VOLUME  II.  NOISE  AND  MAN 


Walier  A.  Rosenblith 
Kenneth  N*  Stevens 

and  the  Staff  of  Bolt,  Beranek,  and  Newman 


June  1953 


Aero  Medical  Laboratory 
Contract  No.  AF  33(038)- 20 57 2 
RDO  No.  695-63 


IKright  Air  Development  Center 
Air  Research  and  Development  Command 
United  States  Air  Force 
%rigla*Patter8on  Air  Force  Base,  Ohio 


Carpenter  Lltho  & Prtg.  Co.  > Springfield,  0. 
1000  • 28  Septen^er  195** 


FOREWORD 


This  report  was  prepared  by  the  firm  of  Bolt, 
Beranek,  and  Newman,  Consultants  in  Acoustics, 
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plemental Agreement  No.  1 and  CO  No.  2 for  the 
Wright  Air  Development  Center.  The  authors  of  this 
report,  Messrs.  Walter  A.  Rosenblith  and  Kenneth  N. 
Stevens,  were  assisted  by  the  entire  staff  of  Bolt, 
Beranek,  and  Newman.  The  work  was  conducted  under 
RDO  No.  695-03,  "Vibration,  Sonic  and  Mechanical 
Action  on  Air  Force  Personnel."  Technical  supei^ 
vision  of  the  preparation  of  the  report  was  the 
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States  Air  Force,  Aero  Medical  Laboratory,  Research 
Division,  Wright  Air  Development  Center,  Wright- 
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ABSTRACT 


This  handbook,  comprising  two  volumes.  Is  intended 
to  provide  an  overall  view  of  the  noise  control  problem. 
Classically,  the  word  noise  denotes  a phenomenon  related 
to  hearing,  and  noise  Is  commonly  defined  as  any  unwanted 
sound.  In  recent  years,  noise  has  come  to  mean  the  anti- 
thesls  of  desired  signal  In  any  stimulus  or  form  of  energy. 
Thermal  motions  produce  noise  in  an  electronic  system  and 
ground  reflections  may  constitute  noise  In  a radar  device. 
Thus  It  becomes  desirable  to  designate  explicitly  the 
subject  matter  of  this  handbook,  acoustic  noise . 

There  are  several  ways  In  which  acoustic  noise  can 
be  undesirable;  It  can  produce  pain  and  damage  of  personnel. 
It  can  Interfere  with  speech  communication,  and  it  can  cause 
annoyance  and  general  degradation  of  environment  for  work 
and  relaxation.  These  matters  are  the  subject  of  Volume  II, 
in  which  the  several  subjective  responses  are  analyzed  and 
correlated  with  properties  of  the  physical  stimuli.  Volume 
I Is  concerned  with  the  stimuli  themselves,  with  their 
generation  and  with  their  control . 

However,  the  problems  encountered  in  the  design  of 
a noise  control  Installation  are  rarely  of  a purely  physical 
or  biological  nature,  but  include  the  varied  factors  of 
economics,  operations  and  planning.  Thus  the  acoustical 
engineer  must  be  capable  of  effecting  a compromise  between 
these  often  contradictory  considerations.  The  present 
volume,  therefore.  Is  a guide  to  assist  In  making  a 
rational  approach  to  the  problems  of  noise  control. 

PUBLICATION . REVIEW 

This  report  has  been  reviewed  and  Is  approved. 

FOR  THE  COMMANDER: 


Colonel,  USAF  (MC) 

Chief,  Aero  Medical  Laboratory 
Directorate  of  Research 
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CHAPTER  15 


BIO-ACOUSTICS:  TERMINOLOGY  AND  LITERATURE 


15.1  Definitions  and  Terminology 

We  Include  here  a number  of  definitions  of  terms  that 
are  used  either  In  Volume  II  of  this  Handbook  or  In  the  litera- 
ture of  the  field.  Most  of  the  terms  are  related  directly  to 
the  areas  of  speech  and  hearing.  There  are,  however,  several 
more  general  definitions,  some  of  which  are  repeated  from 
Chapter  2. 

A majority  of  the  definitions  have  been  drawn  from  the 
American  Standard  Acoustical  Terminology  (1951).  These  defini- 
tions will  be  marked  with  an  asterisk  to  distinguish  them  from 
those  that  come  from  other  sources  or  have  been  compiled 
especially  for  this  Handbook. 

The  definitions  of  Section  5 (Hearing  and  Speech)  of  the 
American  Standard  Acoustical  Terminology  aim  essentially  at 
the  psychophysics  of  pure  tones  and  of  articulation  testing. 
Those  readers  who  are  concerned  mainly  with  the  effects  of 
noise  of  various  types  upon  human  behavior  will  find  that  the 
Standard  Terminology  does  not  always  do  Justice  to  the  area  of 
their  primary  Interest.  It  Is,  of  course,  hardly  to  be  expected 
that  concepts  standardized  for  stimulus  Intensities  In  the  com- 
fortable listening  range  and  for  laboratory  situations  should 
apply  without  modification  to  the  variegated  problems  Involving 
the  Interaction  of  noise  and  man. 

*Alr  Conduction.  Air  conduction  Is  the  process  by  which 
sound  Is  conducted  to  the  Inner  ear  through  the  air  In  the 
outer  ear  canal  as  part  of  the  pathway. 

♦Articulation  (Percent  Articulation)  and  Intelligibility  (Per- 
cent Intelligibility)"  Percent  articulation  or  percent  In- 
telllglblllty  of  a communication  system  Is  the  percentage  of 
the  speech  units  spoken  by  a talker  or  talkers  that  Is  under- 
stood correctly  by  a listener  or  listeners. 

The  word  "articulation"  Is  customarily  used  when  the  con- 
textual relations  among  the  units  of  the  speech  material  are 
thought  to  play  an  unimportant  role;  the  word  "intelligibility" 
Is  customarily  used  when  the  context  Is  thought  to  play  an 
Important  role  In  determining  the  listener's  perception. 
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Note  1 ; It  is  important  to  specify  the  type  of  speech 
material  and  the  units  into  which  it  is  analyzed  for  the  purpose 
of  computing  the  percentage . The  units  may  be  fundamental 
speech  sounds,  syllables,  words,  sentences,  etc. 

Note  2:  The  percent  articulation  or  percent  Intelligi- 

bility is  a property  of  the  entire  communication  system: 
talker,  transmission  equipment  or  medium,  and  listener.  Even 
when  attention  is  focused  upon  one  component  of  the  system 
(e.g.,  a talker,  a radio  receiver),  the  other  components  of 
the  system  should  be  specified. 

j Articulation  Score . See  articulation. 

Articulation  Index.  See  Sec.  16.4. 

♦Artificial  Ear.  An  artificial  ear  is  a device  for  the 
measurement  of  earphones  which  present  an  acoustic  Impedance 
to  the  earphone  equivalent  to  the  Impedance  presented  by  the 
average  human  ear.  It  is  equipped  with  a microphone  for  mea- 
surement of  the  sound  pressures  developed  by  the  earphone . 

♦Audiogram  (Threshold  Audiogram) . An  audiogram  is  a graph 
showing  hearing  loss,  percent  hearing  loss,  or  percent  hearing 
as  a function  of  frequency. 

Audiology.  Audiology  (literally  the  "science  of  hearing")  is 
a field  of  professional  specialization  that  is  primarily  con- 
cerned with  the  treatment  and  measurement  of  impaired  hearing . 

♦Audiometer.  An  audiometer  is  an  Instrument  for  measuring 
hearing  acuity.  Measurements  may  be  made  with  speech  signals, 
usually  recorded,  or  with  tone  signals. 

Note  I Specifications  for  a pure  tone  audiometer  for 
general  diagnostic  purposes  are  covered  by  American  Standard 
Specifications  for  Audiometers  for  General  Diagnostic  Purposes, 
Z 24.5  - 1951 j or  the  latest  revision  thereof  approved  by  the 
American  Standards  Association,  Inc . 

♦Auditory  Sensation  Area. 

(a)  Auditory  sensation  area  is  the  region  enclosed  by 
the  curves  defining  the  threshold  of  feeling  and  the  threshold 
of  audibility  as  functions  of  frequency. 
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(b)  Auditory  sensation  area  is  the  part  of  the  brain 
(temporal  lobe  of  the  cortex)  which  is  responsive  to  auditory 
stimuli . 

Binaural . Binaural  listening  is  listening  with  two  ears. 
Binaural  listening  may  be  either  diotlc  (same  stimulus  to  both 
ears)  or  dlchotic  (different  stimuli  to  the  two  ears). 

Bio-Acoustics . Bio-Acoustics  is  the  field  that  deals,  in  the 
broadest  sense,  with  the  interaction  of  sound  fields  or  me- 
chanical vibratory  phenomena  with  organisms. 

*Bone  Conduction.  Bone  conduction  is  the  process  by  which 
sound  is  conducted  to  the  inner  ear  through  the  cranial  bones . 

Composite  Noise  Rating.  The  composite  noise  rating  rank  orders 
noise  stimuli  in  residential  areas  in  order  to  predict  the 
probable  reactions  they  produce  among  the  residents  (see  Sec. 
18.2) . 

Coupler . See  earphone  coupler. 

Critical  Bandwidth.  The  critical  bandwidth  is  the  width  of 
a band  of  noise  (o^f  the  continuous  type  of  spectrum)  whose 
energy  is  equal  to  that  of  a given  pure  tone  at  its  masked 
threshold.  The  frequency  of  the  pure  tone  is  equal  to  the 
center  frequency  of  the  band  of  noise . 

Damage  Risk  (PR)  Criterion.  A damage  risk  (DR)  criterion 
specifies  the  maximum  sound  pressure  levels  of  a noise  as  a 
function  of  frequency  to  which  people  should  be  exposed  if 
risk  of  hearing  loss  is  to  be  avoided.  A DR  criterion  should 
Include  in  its  statement  a specification  of  such  factors  as 
time  of  exposure,  amount  of  hearing  loss  that  is  considered 
significant,  percentage  of  the  population  to  be  protected,  and 
method  of  measuring  the  noise  (see  Sec.  l8.4). 

♦Difference  Llmen  (Differential  Threshold)  (just  Noticeable 
Differences ) . A difference  Ilmen  is  the  Increment  in  a 
stimulus  which  is  Just  noticed  in  a specified  fraction  of  the 
trials . The  relative  difference  Ilmen  is  the  ratio  of  the 
difference  limen  to  the  absolute  magnitude  of  the  stimulus 
to  which  it  is  related. 

Dlplacusis  (Binaural).  An  observer  exhibits  the  phenomenon 
of  binaural  dlplacusis  when  two  tones  of  the  same  frequency, 
each  presented  to  a different  ear,  appear  to  differ  in  pitch. 


WADC  TR  52-204 


3 


Discrimination  Loss.  A discrimination  loss  Is  given  as  the 
difference  between  100  per  cent  and  the  percentage  of  words 
of  a PB  (phonetically  balanced)  list  that  a listener  repeats 
correctly.  The  PB  list  must  be  presented  at  an  Intensity 
such  that  a further  Increase  In  Intensity  will  not  yield  a 
higher  articulation  score. 

♦Earphone  Coupler.  An  earphone  coupler  Is  a cavity  of  pre- 
determined shape  which  Is  used  for  the  testing  of  earphones. 

It  Is  provided  with  a microphone  for  the  measurement  of  pres- 
sures developed  In  the  cavity. 

Note  1 ; .Couplers  generally  have  a volume  of  6 cubic 
centimeters  for  testing  regular  earphones  and  a volume  of 
2 cubic  centimeters  for  testing  Insert  earphones. 

Note  2;  Specifications  for  couplers  are  given  in  the 
Proposed  American  Standard  Method  for  the  Coupler  Calibration 
of  Earphones,  Z 24.9  - 19^9^  or  the  latest  revision  thereof 
approved  by  the  American  Standards  Association,  Incorporated. 

♦Instantaneous  Speech  Power.  The  Instantaneous  speech  power 
Is  the  rate  at  which  sound  energy  Is  being  radiated  by  a speech 
source  at  any  given  Instant . 

♦Hearing  Loss  (HL)  (Deafness) . The  hearing  loss  of  an  ear  at 
a specified  frequency  Is  the  ratio,  expressed  In  decibels,  of 
the  threshold  of  audibility  for  that  ear  to  the  normal  thres- 
hold. (See  American  Standard  Specification  for  Audiometers 
for  General  Diagnostic  Purposes,  Z 24. 9-1951 ^ or  the  latest 
revision  thereof  approved  by  the  American  Standards  Association, 
Incorporated,  see  also  Sec.  16.3. 

♦Hearing  Loss  for  Speech.  Hearing  loss  for  speech  Is  the 
difference  In  decibels’  between  the  speech  levels  at  which  the  • 
average  normal  e.ap' and  the  defective  ear,  respectively,  reach 
the  same  Intelligibility,  often  arbitrarily  set  at  50  percent. 

♦Level  above  Threshold  (Sensation  Level).  The  level  above  thres 
hold  of  a sound  Is  the  pressure  level  of  the  sound  In  decibels 
above  its  threshold  of  audibility  for  the  Individual  observer. 

Level  Rank.  The  level  rank  is  a means  of  ordering  noise 
stimuli  In  residential  areas  on  the  basis  of  their  octave  band 
levels.  When  certain  correction  numbers  are  applied  to  the 
level  rank,  the  composite  noise  rating  Is  obtained.  (See 
Sec.  l8.2). 
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♦Loudness.  Loudness  Is  the  Intensive  attribute  of  an 
auditory^ sensation,  in  terras  of  which  sounds  may  be  ordered 
on  a scale  extending  from  soft  to  loud. 

Note:  Loudness  depends  primarily  upon  the  sound  pressure 
of  the  stimulus,  but  it  also  depends  upon  the  frequency  and 
wave  form  of  the  stimulus. 

♦Loudness  Contours.  Loudness  contours  are  curves  which  show 
the  related  values  of  sound  pressure  level  and  frequency 
required  to  produce  a given  loudness  sensation  for  a t3q)lcal 
listener. 

*Loudness  Level . The  loudness  level,  in  phons,  of  a sound  is 
numerically  equal  to  the  sound  pressure  level  in  decibels, 
relative  to  0.0002  mlcrobar  (dyne/cm2),  of  a simple  tone  of 
frequency  1000  cps  which  is  Judged  by  the  listeners  to  be 
equivalent  in  loudness . 

♦Masking . Masking  is  the  amount  by  which  the  threshold  of 
audibility  of  a sound  is  raised  by  the  presence  of  another 
(masking)  sound.  The  unit  customarily  used  is  the  decibel. 

♦Masking  Audiogram.  A masking  audiogram  is  a graphical  presenta- 
tion of  the  masking  due  to  a stated  noise . This  is  plotted,  in 
decibels,  as  a function  of  the  frequency  of  the  masked  tone. 

♦Mel . The  mel  is  a unit  of  pitch.  By  definition,  a simple 
tone  of  1000  cps  frequency,  40  decibels  above  a listener's 
threshold,  produces  a pitch  of  1000  mels.  The  pitch  of  any 

sound  that  is  judged  by  the  listener  to  be  n times  that  of  a . 

1-mel  tone  is  n mels.  ' 

Minimum  Audibel  Field  (MAF).  The  minimum  audible  field  is  the  ‘ 
threshold  of  audibility  for  the  case  where  the  sound  pressure 
in  a plane  progressive  wave  is  measured  at  the  position  of  the 

center  of  a listener ' s head  before  the  head  is  Inserted  into 

the  sound  field. 

Minimum  Audible  Pressure  (MAP) . The  minimum  audible  pressure 
is  the  threshold  of  audibility  for  the  case  where  the  sound 
pressure  is  measured  at  the  eardrum  of  the  listener. 

Monaural . Monaural  listening  is  listening  with  one  ear. 
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*Nol3e . Noise  is  any  undesired  sound.  By  extension,  noise 
is  any  unwanted  disturbance  within  a useful  frequency  band, 
such  as  undesired  electric  waves  in  any  transmission  channel 
or  device . 

Noise  Rating.  See  composite  noise  rating. 

Otology.  Otology  is  that  branch  of  medicine  that  is  concerned 
with  diseases  of  the  ear. 

Peak  Clipping.  Peak  clipping  is  a type  of  amplitude  distor- 
tion. A peak-clipping  or  peak-limiting  circuit  does  not  pass 
peaks  exceeding  some  fixed  value.  (See  Sec.  l6.4). 

*Peak  Speech  Power.  The  peak  speech  power  is  the  maximum  value 
of  the  Instantaneous  speech  power  within  the  time  interval 
considered . 

♦Percent  Hearing  Loss  (Percent  deafness).  The  percent  hearing 
loss  at  a given  frequency  is  100  times  the  ratio  of  the  hear- 
ing loss  in  decibels  to  the  number  of  decibels  between  the 
normal  threshold  levels  of  audibility  and  feeling. 

Note  1;  A weighted  mean  of  the  percent  hearing  losses 
at  specified  frequencies  is  often  used  as  a single  measure  of 
the  loss  of  hearing . 

Note  2:  The  American  Medical  Association  has  defined 

percentage  loss  of  hearing  for  medicolegal  use . (See  the 
Journal  of  the  American  Medical  Association,  133^  396-397). 

*Phon . The  phon  is  the  unit  of  loudness  level  as  specified 
in  the  previous  definitions . 

Phonetically  Balanced  (PB) . A PB  list  is  a list  of  monosyllabic 
words  that  contains  a distribution  of  speech  sounds  that  approxi- 
mate the  distribution  of  the  same  sounds  as  they  occur  in  con- 
versational English. 

♦Pitch.  Pitch  is  that  attribute  of  auditory  sensation  in  terms 
of  which  sounds  may  be  ordered  on  a scale  extending  from  low 
to  high,  such  as  a musical  scale. 

Note  1;  Pitch  depends  primarily  upon  the  frequency  of 
the  sound  stimulus,  but  it  also  depends  upon  the  sound  pres- 
sure and  wave  form  of  the  stimulus . 
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Note  2;  The  pitch  of  a sound  may  be  described  by  the 
frequency  of  that  simple  tone,  having  a specified  sound  pres- 
sure or  loudness  level,  which  seems  to  the  average  normal 
ear  to  produce  the  same  pitch. 

Power  Level . Power  level  is  the  ratio,  expressed  in  decibels, 
of  the  total  acoustic  power  W radiated  by  a sound  source  to  a 
reference  power  • Thus  : 

PWL  = 10  log^Q  (W/P^gj.). 

-1^ 

Usually,  Is  taken  as  1.0  x 10  watt. 

Presbycousis . Presbycousls  is  the  condition  of  hearing  loss 
specifically  associated  with  old  age. 

Psycho-Acoustics . Psycho-acoustics  is  that  branch  of  psycho- 
physics in  which  observers  are  asked  to  make  Judgments  when 
acoustic  stimuli  are  presented  to  them. 

Random  Noise . Random  noise  is  a sound  or  electrical  wave 
whose  instantaneous  amplitudes  as  a function  of  time  occur, 
according  to  a normal  (Gaussian)  distribution  curve. 

Recruitment . Recruitment  is  a phenomenon  associated  with 
certain  types  of  hearing  loss  in  which  the  loudness  of  tones 
appears  to  Increase  with  intensity  more  rapidly  than  normal . 

Sensation  Level . See  level  above  threshold. 

Signal-to-Noise  Ratio.  Signal -t,o -noise  ratio  is  the  ratio, 
usually  expressed  in  decibels,  of  the  intensity  of  a signal 
to  the  Intensity  of  a noise  that  masks  or  partially  masks  the 
signal . 

Social  Adequacy  Index  (SAI) . The  Social  Adequacy  Index  is  a 
number  in  the  range  1 to  100  that  attempts  to  evaluate 
communicative  adequacy  or  social  adequacy  for  hearing . It  can 
be  determined  from  the  hearing  loss  for  speech  and  the  discrimina- 
tion loss  . 

*Sone . The  sone  is  a unit  of  loudness.  By  definition,  a 
simple  tone  of  frequency  1000  cps,  40  decibels  above  a 
listener's  threshold,  produces  a loudness  of  one  sone.  The 
loudness  of  any  sound  that  is  judged  by  the  listener  to  be  n 
times  that  of  the  one  sone  tone  is  n sones . 

Note  1 ; A milllsone  is  equal  to  0.001  sone. 
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Note  2;  The  loudness  scale  is  a relation  between  loud- 
ness and  level  above  threshold  for  a particular  listener.  In 
presenting  data  relating  loudness  in  sones  to  sound  pressure 
level,  or  in  averaging  the  loudness  scales  of  several  listeners, 
the  thresholds  (measured  or  assumed)  should  be  specified. 

Note  3?  The  term  "loudness  unit"  has  been  used  for  the 
basic  subdivision  of  a loudness  scale  ba-sed  on  group  Judgment 
on  which  a loudness  level  of  4o  phons  has  a loudness  of 
approximately  1000  loudness  units.  For  example,  see  Pig.  1 
of  the  American  Standard  for  Noise  Measurement. 

♦Sound -Level  Meter.  A sound-level  meter  is  an  instrument 
including  a microphone,  an  amplifier,  an  output  meter,  and 
frequency  weighting  networks  for  the  measurement  of  noise  and 
sound  levels  in  a specified  manner;  the  measurements  are  in- 
tended to  approximate  the  loudness  level  which  would  be 
obtained  by  the  more  elaborate  ear  balance  method. 

Note ; Specifications  for  sound-level  meters  for 
measurement  of  noise  and  other  sounds  are  given  in  the  American 
Standard  Sound  Level  Meters  for  Measurement  of  Noise  and  Other 
Sounds,  Z 24.3  - or  the  latest  revision  thereof  approved 

by  the  American  Standards  Association,  Incorporated. 

♦Sound  Pressure  Level  (SPL) . The  sound  pressure  level,  in 
decibels,  of  a sound  is  20  times  the  logarithm  to  the  base  10 
of  the  ratio  of  the  pressure  of  this  sound  to  the  reference 
pressure.  The  reference  pressure  shall  be  explicitly  stated. 

Note  1 : The  following  reference  pressures  are  in  common 

use : 


-4  2 

(a)  2 X 10  mlcrobar  (dyne/cm  ) . 

(b)  1 mlcrobar. 

Reference  pressure  (a)  has  been  in  general  use  for  measure- 
ments dealing  with  hearing  and  sound-level  measurements  in  air 
and  liquids,  while  (b)  has  gained  widespread  use  for  calibra- 
tions and  many  types  of  sound-level  measurements  in  liquids. 

Note  2;  It  is  to  be  noted  that  in  many  sound  fields  the 

sound  pressure  ratios  are  not  proportional  to  the  square  root 
of  corresponding  power  ratios  and  hence  cannot  be  expressed 
in  decibels  in  the  strict  sense;  however,  it  is  common  practice 
to  extend  the  use  of  the  decibels  to  these  cases. 
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Spectrum  Level . The  spectrum  level  of  a noise  at  a specified 
frequency  is  the  sound  pressure  level  within  a band  one  cps 
wide  centered  at  the  frequency. 

Speech  Communication  (SC)  Crlteri-on.  A speech  communication 
criterion  specifies  the  sound  pressure  level  and  spectrum  of 
a masking  noise  necessary  to  guarantee  a specified  quality  of 
speech  communication. 

Speech  Interference  Level  (SIL) . The  speech  Interference 
level  of  a noise  is  the  average,  in  decibels,  of  the  sound 
pressure  levels  of  the  noise  in  the  three  octave  bands  of 
frequency  600-1200,  1200-2400  and  2400-4800  cps. 

Spondees . A spondee  is  a two-syllable  word  with  equal  stress 
on  both  syllables. 

^Threshold  of  Audibility  (Threshold  of  Detectability) . The 
threshold  of  audibility  for  a specified  signal  is  the  minimum 
effective  sound  pressure  of  the  signal  that  is  capable  of 
evolving  an  auditory  sensation  in  a specified  fraction  of  the 
trials.  The  characteristics  of  the  signal,  the  manner  in 
which  it  is  presented  to  the  listener,  and  the  point  at  which 
the  sound  pressure  is  measured  must  be  specified. 

Note  1 : Unless  otherwise  indicated,  the  ambient  noise 

reaching  the  ears  is  assumed  to  be  negligible . 

Note  2:  The  threshold  may  be  expressed  in  decibels 

relative  to  0.0002  microbar  or  to  1 mlcrobar. 

Note  3«  Instead  of  the  method  of  constant  stimuli,  which 
is  implied  by  the  phrase  " a specified  fraction  of  the  trials”, 
another  psychophysical  method  (which  should  be  specified)  may 
be  employed. 

*Threshold  of  Peeling  (or  Discomfort,  Tickle,  Pain)  (dr 
Tolerance  Threshold).  The  threshold  of  feeling  (or  discomfort, 
tickle,  or  pain)  for  a specified  signal  is  the  minimum  effec- 
tive sound  pressure  of  that  signal  which,  in  a specified  frac- 
tion of  the  trials,  will  stimulate  the  ear  to  a point  at  which 
there  is  the  sensation  of  feeling  (or  discomfort,  tickle,  pain). 

Note  1;  Characteristics  of  the  signal  and  the  measuring 
technique  must  be  specified  in  every  case. 
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Note  2;  This  threshold  is  customarily  expressed  in 
decibels  relative  to  0.0002  microbar  or  1 mlcrobar. 

Threshold  of  Detectability  (for  speech) . The  threshold  of 
detectability  is  the  speech  level  at  which  the  listener  is 
Just  able  to  detect  the  presence  of  speech  sounds  about  half 
the  time  without  being  able  to  Identify  any  of  the  individual 
sounds  precisely. 

Threshold  of  Intelligibility  (for  speech) . The  threshold  of 
Intelligibility  is  the  speech  level  at  which  the  listener  is 
Just  able  to  obtain  without  perceptible  effort  the  meaning  of 
almost  every  sentence  and  phrase  of  the  connected  discourse. 

Threshold  of  Perceptibility  (for  speech) . The  threshold  of 
perceptibility  is  the  faintest  speech  level  at  which  the  mean- 
ing of  connected  discourse  can  be  followed  by  trained  listeners 
exerting  attentive  effort . 

Tinnitus . When  an  observer  reports  that  he  hears  sounds  in  the 
absence  of  any  external  acoustic  stimulation,  he  is  said  to 
have  tinnitus.  (Sometimes  called  "head  noises"  or  "ringing  in 
the  ears".) 

♦Ultrasonics . Ultrasonics  is  the  general  subject  of  sound  in 
the  frequency  range  above  about  15  kilocycles  per  second. 

White  Noise . White  noise  is  a sound  or  electrical  wave  whose 
spectrum  is  continuous  and  uniform  as  a function  of  frequency. 
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CHAPTER  16 


BASIC  BIO-  AND  PSYCHO -ACOUSTIC  DATA 


l6 . 1 Introduction 

The  next  few  chapters  deal  with  man's  behavior  In 
response  to  sound  and  with  the  effects  of  Intense  sound  upon 
man.  To  the  engineer  these  problems  may  not  appear  to  be  of 
great  interest.  He  may  feel  that  there  Is  little  he  can  do 
about  such  matters.  "You  can't  deal  with  the  human  factor," 
or,  "that's  human  nature,"  he  may  say.  Such  statements  are 
only  too  often  made  just  before  throwing  up  one's  hands  when 
confronted  with  a problem  Involving  the  Interaction  of  man 
and  his  environment.  In  the  pages  that  follow,  we  shall 
attempt  to  describe  certain  general  features  of  man's  behavior 
In  response  to  sound. 

While  we  are  not  trying  to  give  the  Impression  that  we 
can  predict  the  reactions  of  a particular  person  when  In  the 
presence  of  certain  sounds,  we  can  make  useful  statements  con- 
cerning the  probability  with  which  we  can  expect  to  observe 
certain  reactions  to  sounds  among  a large  number  of  Individuals. 
We  shall  In  most  cases  be  able  to  state  the  range  over  which 
behavior  in  response  to  sound  can  be  observed  and  we  may  even 
be  able  to  give  a fairly  precise  description  of  the  way  In  which 
the  average  man  would  react  in  response  to  a great  number  of 
noises  or  sounds  that  are  actually  encountered.  Even  though  no 
single  customer  In  the  United  States  buys  the  amount  of  goods 
that,  according  to  statistics,  Mr.  Average  Citizen  and  his 
family  will  acquire  In  the  course  of  a year,  most  manufacturers 
and  department  store  managers  are  still  very  happy  to  have  the 
statistical  Information.  Such  Information  permits  them,  and 
anyone  who  deals  with  large  numbers  of  people,  to  take  measures 
which  on  the  average  and  In  the  long  run  will  prove  effective 
and  profitable . 

This  Is  the  principle  on  which  Insurance  companies  operate 
and  pay  dividends;  they  do  not  know  when  any  one  of  their  clients 
Is  going  to  die,  but  they  are  satisfied  If  they  can  predict  how 
many  among  the  large  number  of  their  clients  will  die  during  any 
period.  In  other  words,  anyone  who  deals  with  reasonably  large 
numbers  of  people  (or  mass  production  articles,  for  that  matter) 
may  often  be  satisfied  to  know  how  they  will  behave  on  the 
average . 
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It  is  true  that  in  some  cases  we  might  want  to  know  how 
a particular  individual  is  going  to  be  affected  by  noise.  We 
may  be  able  to  get  the  information  by  further  experimentation 
upon  the  individual  in  question  or  we  may  have  to  be  satisfied 
in  sharing  the  frustration  of  the  physicist,  who  must  remain 
forever  ignorant  of  what  happens  to  a single  electron  or 
molecule . 


l6.2  Responses  of  Man  to  Simple  Acoustic  Stimuli 

Introduction.  People  listen  to  speech,  music,  the  high 
whine  of  mosquitoes,  the  low  rumbling  of  thunder.  They  also 
hear  various  noises  generated  by  industry,  aircraft,  PT  boats, 
subways,  and  other  sources  of  often  unwanted  sounds.  How  do 
we  know  that  one  of  our  fellow  citizens  hears  a given  sound? 

We  can  obviously  ask  him  and  he  can  tell  us;  that  is,  in  most 
cases  he  can  tell  us.  'Hearing'  is  thus  linked  with  or  even 
defined  by  the  occurrence  of  a verbal  response.  There  arise, 
however,  certain  situations,  as  in. the  case  of  Infants, 
auilmals,  or  adults  who  for  some  reason  cannot  talk,  in  which 
we  need  a more  general  definition  of  hearing.  We  say  that  a 
sound  is  heard  by  an  organism  if  the  organism  reacts  or  re 
spends  to  it.  Assume,  for  example,  that  we  have  control 
over  a sound  stimulus.  We  will  say  that  a person  hears  this 
particular  sound  if,  whenever  we  turn  the  sound  on,  our  subject 
responds  in  a significant  number  of  presentations  of  the 
stimulus.  The  subject  may  say  "I  hear  it"  or  lift  his  finger, 
or  if  he  is  an  animal,  move  toward  a previously  arranged 
location. 

The  problem  of  determining  the  effects  of  noise  on  man 
Involves  several  scientific  disciplines.  We  need  physicists 
and  acoustical  engineers  who  are  able  to  measure  the  energy, 
spectral  composition,  and  temporal  characteristics  of  noises 
and  of  stimuli  that  might  be  Involved  in  hearing  tests. 
Psychologists  will  tell  us  about  the  various  methods  by  means 
of  which  we  can  determine  and  measure  the  reactions  of  people 
and  animals  to  sound.  Otologists  can  give  people  medical 
examinations  of  their  ears.  Meanwhile,  physiologists  will 
study  the  effects  of  sounds  upon  animal  and  human  ears  and 
those  parts  of  the  nervous  system  that  function  in  connection 
with  the  ear.  Prom  research  on  animals  we  can  learn  which  part 
of  the  organism  is  most  critically  affected  by  what  kind  of 
noise  and  also  what  kind  of  protective  measures  we  might 
possibly  adopt.  The  industrial  hydlenlst,  the  audiologist. 
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and  the  safety  engineer  have  their  role  to  play:  they  can 

spot  trouble  with  respect  to  both  noise  sources  and  onset  of 
hearing  loss.  They  can  also  assist  in  the  adoption  of  various 
noise  control  schemes  designed  by  acousticians  and  perhaps,  in 
given  situations  supervise,  in  cooperation  with  medical  per- 
sonnel, the  wearing  of  protective  devices  for  the  ear. 

People  often  ask  the  question,  "How  do  we  hear?"  Many 
textbook  writers  on  acoustic  matters  tell  us  that  "the  ear 
can  do  such  and  such."  Such  a statement  usually  Involves 
certain  oversimplifications . We  do  not  know  how  we  hear  any 
more  than  we  know  how  we  see  or  how  we  think.  All  we  can  do 
is  describe  the  reactions  that  people  and  animals  exhibit  to 
certain  classes  of  sound.  We  can  also  measure  certain  me- 
chanical and  electrical  events  that  occur  Inside  the  ear  or 
in  the  auditory  nervous  system  when  certain  sound  stimuli  are 
presented.  We  can  go  further  by  removing  certain  parts  of 
an  animal's  anatomy  and  then  observe  the  effects  of  such 
removals  upon  the  behavior  of  the  organism  as  a whole  or  upon 
the  above-mentioned  mechanical  and  electrical  events . Since 
our  definition  of  hearing  involves  a reaction  or  response  from 
an  entire  organism,  we  should  be  careful  not  to  attribute  to 
the  ear  alone  the  behavior  of  the  animal  or  man  in  response  to 
auditory  stimuli. 

On  one  hand  certain  researchers  are  now  making  very  pain- 
staking measurements  on  the  electrical  and  mechanical  events 
that  occur  in  the  ear  when  a given  stimulus  is  presented.  On 
the  other  hand,  it  has  been  conclusively  shown  that  the  removal 
of  certain  parts  of  the  auditory  nervous  system  abolishes  the 
response  to  some  stimuli  even  if  the  animal's  ear  as  such  re- 
mains intact . Just  how  much  of  the  ear  and  how  much  of  the 
auditory  nervous  system  are  Involved  in  certain  responses  from 
animals  or  humans  to  sound  is  at  the  present  time  largely 
unknown.  We  shall  later  give  a brief  description  of  the  most 
important  parts  of  the  ear  and  of  the  nervous  system  as  they 
function  in  response  to  acoustic  stimuli  (see  Appendix  l). 

While  we  know  little  about  how  people  hear,  we  can  say 
a good  deal  about  the  kinds  of  sounds  people  hear  or  to  which 
they  react.  During  the  last  two  decades  sufficiently  precise 
electroacoustic  equipment  has  become  available  that  we  can 
now  produce  all  sorts  of  stimuli  and  measure  them  with  reason- 
able accuracy.  Yet  most  of  our  data  have  been  collected  on 
pure  tone  stimuli,  l.e.,  sounds  that  correspond  to  sinusoidal 
variations  in  pressure  and  whose  duration  is  very  long  compared 
to  the  period  of  a single  sinusoid. 
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Now  there  Is  nothing  wrong  with  the  use  of  pure  tone 
stimuli  provided  they  are  reasonably*  pure.  However,  we 
must  not  conclude  that  If  the  law  of  superposition  holds  for 
stimuli.  It  holds  necessarily  for  responses.  The  sounds  we 
hear  In  every  day  life*  are  certainly  not  pure  tones . Neither 
speech  sounds,  nor  the  notes  from  musical  Instruments,  nor  the 
sounds  made  by  the  breaking  of  a twig  or  by  Jet  engines  are 
simple  sinusoids.  Most  of  the  sounds  we  encounter  In  our  lives 
are  emitted  by  sources  that  have  not  Just  single  resonances . 
Human  speech,  for  example.  Is  characterized  by  the  multiple 
resonances  of  the  vocal  cavities.  These  cavities  may  be  ex- 
cited by  a periodic  source,  producing  an  output  of  high 
harmonic  content,  or  by  a sound  source  which  originates  from 
the  turbulent  flow  of  air.  The  same  kind  of  analysis  can  be 
applied  to  a good  many  other  natural  sounds . 


We  use  the  word  "reasonably, " otherwise  we  would 
have  to  sound  our  pure  tones  from  minus  Infinity 
to  plus  Infinity  since  no  short  tone  Is  ever  pure 
and  no  pure  tone  Is  ever  short . 


Figure  l6.1 

Several  determinations  of  the  absolute  threshold 
for  pure  tones  are  shown.  Curves  1 and  2 repre- 
sent the  determinations  of  minimum  audible  pres- 
sure (map)  and  minimum  audible  field  (MAF)  of 
Slvlan  and  White  1^/.  Bekesy's  2,3/  measurements 
of  MAP  In  the  low  and  middle  frequency  range  are 
shown  as  Curve  3.  The  MAP  data  of  Waetzmann  and 
Kelbs  4/  are  given  as  Curve  4.  The  results  of 
three  audlometrlc  surveys  are  given  as  a set  of 
three  curves  marked  5.  The  solid  line  represents 
the  median  values  for  listeners  20  to  29  years  of 
age  who  took  the  hearing  tests  at  the  Bell  Tele- 
phone Company's  exhibits  at  the  New  York  and  San 
Francisco  World's  Fairs  In  1938  and  1939  5/.  The 
dashed  and  dotted  curves  were  obtained,  respec- 
tively, by  the  Bell  Telephone  Laboratories  6/  and 
by  the  United  States  Public  Health  Service  7/. 

The  American  Standards  Association  8/  has  adopted 
a "standard"  MAF  curve  (Curve  6),  which  crosses 
the  1000  cps  ordinate  at  exactly  0 db  SPL,  l.e., 
at  a sound  pressure  of  0.0002  dyne/cm^.  (After 
J.  C.  R.  Llckllder  9/). 
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WAETZMANN  a KEIBS  MAP 


What  Is  so  complicated  about  this,  ask  those  familiar 
with  Fourier  analysis.  All  we  have  to  do  is  to  determine  the 
spectrum  of  the  stimulus  and  then  we  can  predict  what  the 
response  to  complex  sounds  is  going  to  be.  This  is  precisely 
the  point  where  they  may  go  wrong . The  kind  of  behavior  we 
observe  in  response  to  complex  acoustic  stimuli  is  only  rarely 
predictable  from  the  knowledge  of  the  response  to  the  various 
Isolated  components.  The  listener  behaves  very  much  like  a 
non-linear  device . We  know  that  over  Timlted  ranges  we  can 
often  replace  non-linear  systems  by  approximately  linear  ones. 
However,  the  characteristics  of  organisms  are  not  only  non- 
linear but  also  time-varying.  It  may,  therefore,  be  unecon- 
omical to  look  for  an  equivalent  linear  circuit  in  order  to 
imitate  the  behavior  of  an  individual  in  his  reactions  to 
complex  sounds. 

Let  us  emphasize  our  point  by  giving  an  example . We 
first  sound  a pure  tone  A,  measure  its  intensity,  and  have 
someone  Judge  its  loudness.  (Let  us  not  enter  here  into  the 
question  as  to  how  one  arrives  at  such  a Judgment;  let  us 
assume  that  we  know  how  to  get  loudness  Judgments  from  our 
subject).  Now  we  sound  a tone  B having  the  same  intensity  as 
A and  we  again  get  a loudness  Judgment  from  our  subject.  It 
is  a rather  simple  matter  to  measure  the  intensity  of  A and  B 
when  they  are  sounded  together.  However,  in  order  to  pre- 
dict the  loudness  of  the  combination,  we  will  need  further 
Information.  How  close  in  frequency  are  tones  A and  B?  Do 
they  beat  with  each  other?  What  is  their  absolute  intensity 
level?  Given  these  data,  we  may  be  able  to  make  a good  guess 
concerning  the  loudness  of  the  combined  pair  of  stimuli  A and 
B.  What,  however,  if  one  of  the  pure  tones  were  replaced  by 
noise?  Then  we  are  dealing  with  a different  problem.  Or 
take  such  phenomena  as  masking  and  auditory  fatigue  which 
are  discussed  below.  They  are  again  examples  of  failure  of 
the  principle  of  superposition  for  responses.  This  principle 
predicts  man's  behavior  over  a rather  narrow  range,  and  must 
be  abandoned,  or  at  best  modified,  as  soon  as  one  goes  beyond 
the  boundaries  of  that  range . 

With  these  words  of  caution,  let  us  proceed  to  examine 
some  of  the  available  data  concerning  the  response  of  people 
to  acoustic  stimuli  such  as  pure  tones,  sharp  clicks,  or 
wide -band  noise. 

Audible  Frequency  Range . The  first  question  we  are 
likely  to  ask  concerns  the  frequency  range  over  which  man  re- 
sponds to  stimuli.  Disregarding  Intensity,  what  is  the  low- 
est frequency  man  recognizes  and  identifies  as  a sound  and 

WADC  TR  52-204  l8 


what  is  the  highest  frequency  at  which  young  people  with 
normal  hearing  can  still  reliably  report  when  the  tone  is  on 
and  when  it  is  off? 

Before  we  answer  these  two  questions  let  us  examine 
briefly  Fig.  l6.1  which  shows  several  curves  for  the  thres- 
hold of  hearing . For  the  moment  we  need  to  concentrate 
only  upon  the  overall  appearance  of  this  family  of  curves, 
and  not  upon  any  details.  We  see  that  man  is  most  sensitive, 
i.e.,  able  to  respond  to  the  weakest  stimuli,  in  the  range 
between  1000  and  3000  cps . As  the  frequency  decreases  below 
1000  cps  or  Increases  above  4000  cps,  a higher  and  higher 
sound  pressure  level  is  necessary  to  obtain  a response. 

Below  50  cps  and  above  15 j 000  cps,  the  curve  becomes 
uncertain.  It  is  rather  difficult  to  produce  really  pure 
tones  of  very  low  frequency  and  it  is  rather  difficult  to 
measure  accurately  the  stimulus  Intensity  of  a tone  above 
6000  or  8000  cps . What  have  these  considerations  to  do 
with  the  answer  to  the  two  questions  we  have  raised  above? 

If  our  low  frequency  stimulus  is  not  pure,  i.e.,  if  it 
contains  harmonics,  we  are  not  sure  to  what  extent  the  sub- 
ject has  responded  to  the  higher  frequencies,  and  to  what 
extent  he  has  responded  to  the  fundamental , At  the  high 
frequency  end  the  situation  is  different.  Here,  too,  we 
must  make  sure  that  the  stimulus  is  really  pure  (the  output 
of  a beat  frequency  oscillator  around  20  kcps  is  often  con- 
taminated by  frequencies  which  are  in  the  range  where  man  is 
most  sensitive).  We  must  also  make  sure  that  (in  view  of 
the  high  intensity  needed  to  get  responses)  the  acoustic 
transducer  does  not  produce  audible  subharmonics . 

If  all  these  precautions  are  taken  and  if  one's  sub- 
jects are  picked  carefully  (i.e.,  young  people  without  any 
history  of  medical  involvement  of  the  ear)  one  arrives  at 
the  conclusion  that  responses  to  airborne  sounds  are  fairly 
reliably  obtained  up  to  frequencies  of  about  20  kcps . 

Recent  experimentation  in  Germany,  England  and  in  this 
country  has  shown  that  people  will  still  say,  *'l  hear  some- 
thing," when  bone -conducted  sounds  of  frequencies  of  as  high 
as  100  kcps  are  presented  at  sufficiently  high  intensity 
levels  10,11,12/.  This  is  not  the  place  to  enter  into  a 
discussion  of  what  people  mean  when  they  say  they  'hear' 
such  sounds.  The  pitch  of  these  "ultrasonic"  stimuli  seems 
rather  vague  and  has  been  reported  to  be  equal  to  the  pitch 
of  the  highest  airborne  sound  the  subject  can  hear.  If  the 
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frequency  of  the  sound  delivered  to  the  skull  Is  varied, 
the  subject  will  localize  the  stimulus  alternately  at  one 
ear  and  then  at  the  other. 

This  discovery  may  force  a more  precise  definition  of 
what  we  mean  by  "hearing"  as  compared  to  responding  or  react- 
ing to  sound.  It  Is  common  knowledge  that  animals  smaller 
than  men  respond  to  higher  frequencies.  A dog  responds  to 
airborne  sounds  up  to  perhaps  40  kcps.  Recent  Investigations 
tell  us  that  the  common  mouse  reacts  to  frequencies  as  high 
as  100  kcps  £ind  the  researches  on  the  navigation  of  bats 
Indicate  that  these  animals  make  very  effective  use  of  highly 
directional  beams  of  high  frequency  sounds  for  the  detection 
of  obstacles. 

At  the  low  frequency  end,  practically  all  our  data 
come  from  Bekesy's  work  2,3/.  His  subjects  were  able  to  give 
a response  to  frequencies  as  low  as  1 cps . Here  again,  how- 
ever, there  Is  some  uncertainty  as  to  whether  the  subjects 
are  really  "hearing"  the  1 cps  tone,  or  whether  they  are 
detecting  It  In  some  other  way.  We  clearly  do  not  react  to 
slow  variations  In  barometric  pressure  as  sound.  Many  text- 
books state  that  man's  lower  frequency  limit  of  hearing  lies 
In  the  range  of  l6  to  20  cps.  In  view  of  Bekesy's  work, 
however,  we  are  probably  Justified  in  selecting  1 or  2 cps 
as  the  lower  frequency  limit. 

Thresholds  of  Hearing.  Now  that  we  know  something  about 
the  frequency  range  over  which  man  responds  to  sounds,  let  us 
examine  In  more  detail  the  range  of  stimulus  Intensities  to 
which  he  reacts . We  have  already  said  that  man  Is  not  equally 
sensitive  to  all  frequencies,  suid  we  might  Inquire  how  much 
energy  or  Intensity  (here  defined  as  the  rate  of  flux  of  energy) 
Is  necessary  to  get  a response  at  the  optimum  frequency.  We 
might  also  ask  what  Is  the  "dynamic  range"  of  man,  and. 
Incidental  to  this  question,  we  might  determine  the  amount  of 
sound  man  will  stand  before  he  seriously  tells  us  that  "this 
Is  painful " . 

Man's  auditory  threshold  Is  commonly  defined  as  the 
minimal  acoustic  energy  In  the  presence  of  which  he  will  say 
"I  hear".  Offhand  It  would  not  appear  difficult  to  establish 
this  value,  but  complications  arise  when  we  examine  the  con- 
cept of  the  threshold  and  the  measurement  procedure  In  more 

detail.  The  naive  concept  of  the  threshold  Is  essentially 
the  following;  There  exists  a value  I^  for  any  acoustic 
stimulus,  such  as  a pure  tone,  that  separates  stimuli  Into 
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two  classes.  Those  stimuli  whose  Intensity*  Is  less  than 
will  never  evoke  a response,  while  those  whose  Intensity  Is 
greater  than  It  will  always  (under  carefully  defined  "normal 
conditions")  evoke  a response. 

The  only  problem  Is  to  determine  I^.  Even  a little 
preliminary  experimentation  shows  us  that  events  are  not  so 
nicely  categorized.  To  be  sure,  there  are  values  of  the 
stimulus  to  which  our  subject  hardly  ever  responds  and  others 
to  which  he  practically  always  responds,  (that  Is,  unless  he 
Is  asleep  or  daydreaming  or  otherwise  "not  paying  attention"). 
But  then,  there  are  stimulus  Intensities  to  which  he  responds 
most  of  the  time  and  others  to  which  he  responds  perhaps  only 
30  per  cent  or  50  per  cent  of  the  time.  In  other  words,  there 
is  a zone  of  uncertainty  that  surrounds  It-  The  organism  Is 
variable  and  does  not  always  behave  exactly  the  same  way  when 
confronted  with  a stimulus  which,  according  to  our  measuring 
instruments.  Is  the  same.  This  Is  not  a serious  difficulty 
If  we  are  prepared  to  apply  a little  statistics . Let  us 
determine  the  so-called  psychometric  curve  that  relates  rela- 
tive frequency  of  response  to  stimulus  Intensity,  and  let  us 
define  It  as  the  Intensity  of  the  stimulus  to  which  the  sub- 
ject responds  50  per  cent  of  the  time.  We  understand,  then, 
that  the  absolute  threshold  Is  not  a fixed  point  on  the 
stimulus  scale . A single  value  representing  the  absolute 
threshold  must  necessarily  be  a statistical  quantity. 

The  problem  that  remains  is  the  measurement  of  I-^  with 
a microphone  13/.  But  where  are  we  to  put  that  microphone? 

In  other  words,  where  and  how  do  we  propose  to  measure  the 
stimulus?  To  make  a rather  Involved  story  relatively  simple, 
let  us  say  that  we  have  several  methods  of  measuring  stimulus 
intensity,  (l.e.,  the  sound  pressure  that  specifies  the 
strength  of  the  stimulation) . We  can  measure  the  sound  field 
as  It  exists  In  the  absence  of  the  subject  and  relate  the 
subject's  "I  hear"  to  that  stimulus  value.  Alternatively  we 
can  measure  the  actual  sound  pressure  at  the  eardrum  by  means 
of  a small  probe  microphone-***  and  can  consider  this  last  look 


* Stimulus  Intensity  Is  here  defined  as  a generic 

term  that  relates  to  the  strength  of  the  stimulus; 
whenever  we  actually  make  measurements  we  specify 
the  sound  pressure  level . 

**  The  probe  microphone  sould  not  by  the  mere  fact  of 
Its  presence  Interfere  appreciably  with  what  you 
want  to  measure.  This  may  not  be  such  an  easy  matter, 
especially  at  high  frequencies . 
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at  the  acoustic  disturbance  before  It  enters  the  organism 
as  our  effective  stimulus.  Both  methods  have  been  used. 

The  first  one  goes  under  the  name  of  minimum  audible  field 
(map)  and  the  second  one  by  the  name  of  minimum  audible 
pressure  (MAP).  Obviously  threshold  curves  plotted  In  terms 
of  MAP  and  MAP  do  not  look  quite  the  same. 

Especially  at  the  higher  frequencies,  there  Is  a good 
deal  of  discrepancy  between  the  two  sets  of  curves  since  It 
Is  at  those  frequencies  that  diffraction  effects  around  the 
head  and  resonance  effects  In  the  outer  ear  play  their  great- 
est role.  The  outer  ear  up  to  the  eardrum  acts  like  a small 
resonator  at  frequencies  for  which  the  wavelength  Is  an 
appropriate  multiple  of  the  distance  to  the  eardrum.  Investiga- 
tions dealing  with  these  diffraction  and  resonance  effects  have 
brought  MAP  and  MAP  values  for  thresholds  Into  reasonably  close 
agreement  14,13/,  except  for  a mysterious  difference  of  about 
6 db.  This  difference  Is  particularly  noticeable  at  low 
frequencies,  and  to  explain  It  has  so  far  taxed  the  Ingenuity 
of  many  an  Investigator  l6,  17,  18/. 

Now  that  we  have  seen  how  It  can  be  specified,  are  our 
troubles  over?  Not  quite . We  still  have  to  settle  upon  an 
experimental  method  for  determining  It.  Are  we  going  to  give 
our  subject  an  oscillator,  an  earphone,  and  an  attenuatpr  and 
tell  him:  "Tell  us  the  minimal  setting  of  this  dial  at  which 
you  cein  hear  a tone  of  1000  cps"?  Or,  are  we  going  to  have 
him  sit  down  In  a chair  and  raise  his  finger  every  time  he 
hears  a tone  while  we  manipulate  the  Intensity  of  the  stimulus 
up  or  down?  Even  though  the  actual  techniques  of  measurement 
of  a stimulus  may  contribute  to  the  variability  of  the .thres- 
hold, It  Is  our  suspicion  that  a good  deal  of  the  disagreement 
on  threshold  values  between  experimenters  In  the  United  States 
and  In  Europe,  for  Instance,  may  be  due  to  differences  In  the 
psychophysical  method  that  Is  used  to  determine  the  thresholds. 
This  may  seem  strange  until  we  reflect  upon  the  fact  that  even 
accurately  calibrated  thermometers  will  Indicate  different 
temperatures  at  points  Intermediate  between  the  calibration 
points.  We  might  even  be  able  to  show  discrepancies  If  we 
use  a single  rod  of  copper  and  If  we  once  measure  the  tempera- 
ture on  the  basis  of  change  In  length  of  the  rod  and  another 
time  on  the  basis  of  a change  In  the  electrical  resistance. 

True  enough,  the  discrepancies  would  be  rather  small  and  we 
would  only  be  able  to  bring  them  out  If  we  had  a precise  way 
of  measuring  temperature.  However,  this  whole  discussion 
emphasizes  a fact  (of  which  anybody  who  has  dealt  with  micro- 
phones Is  quite  aware),  that  data  In  psycho-acoustics,  like 
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results  in  the  physical  sciences,  are  meaningful  only  if 
the  method  by  means  of  which  the  results  are  obtained  is 
specified . 

In  the  section  that  deals  with  audiometry  (See.  16.3) is 
a further  discussion  of  some  of  the  problems  of  how  one 
measures  the  absolute  threshold  of  hearing  in  an  industrial 
or  military  situation  in  contrast  to  a laboratory  situation. 
[For  a detailed  discussion  of  psychophy'slcal  methods  used  to 
measure  thresholds,  see  I.  J.  Hirsh,  The  Measurement  of  Hear- 
ing 12/1. 

Let  us  now  turn  to  the  problem  of  man's  "dynamic  range". 
Figure  16.1  shows  that  under  optimum  conditions,  we  are  able 
to  detect  a sound  with  an  intensity  of  roughly  10"^°  watts/sq 
cm.  This  is  better  than  the  "thresholds"  of  various  high 
quality  microphones . Various  calculations  have  shown  that  a 
further  increase  in  man's  sensitivity  would  make  it  almost 
possible  for  him  to  detect  the  noise  produced  by  the  Brownian 
motion  of  the  molecules  in  the  air  at  normal  temperatures . 
Experimental  evidence  tells  us  that  the  amplitude  of  vibration 
of  the  eardrum  near  man's  lowest  threshold  is  of  the  order  of 
10~^®cm,  or  about  one -hundredth  of  the  diameter  of  a hydrogen 
atom. 


A question  that  we  might  legitimately  ask  is  "How  far 
above  the  absolute  threshold  do  we  have  to  go  before  man  will 
experience  pain?"  We  do  not  have  any  completely  satisfactory 
data  on  the  subject,  but  the  best  available  evidence  indicates 
that  the  range  between  the  lowest  absolute  threshold  and  the 
threshold  of  pain  is  about  l40  db.  That  is  quite  a respectable 
performance,  if  you  compare  it  with  the  performance  of  any 
single  physical  instrument . 

We  have  no  Intention  of  disparaging  the  data  that  have 
been  collected  on  so-called  tolerance  thresholds,  but  there 
are  some  obvious  difficulties  Involved  in  such  a research  pro- 
ject. First  of  all,  there  are  certain  semantic  obstacles  to 
be  overcome . The  older  literature  talked  about  thresholds  of 
feeling  or  of  tickle;  the  most  recent  investigation  (Silverman 
20/)  attempted  to  establish  three  different  contours;  Silver- 
man  called  them,  in  order,  threshold  of  discomfort,  thres- 
hold of  tickle,  and  threshold  of  pain.  The  subjects  were 
Instructed  in  the  meaning  of  these  words,  but  as  the  investiga- 
tion wore  on,  their  standards  shifted  upwards  in  Intensity. 

It  is,  of  course,  difficult  to  tell  if  this  shift  should  be 
attributed  to  an  accommodation  process  in  the  physiological 
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mechanism  or  to  a psychologically  different  interpretation 
of  such  words  as  discomfort  and  pain.  Before  the  investiga- 
tion had  terminated,  the  maximum  sound  pressure  level  that 
the  earphones  could  produce  was  found  to  be  too  low  to 
establish  the  ultimate  thresholds  of  tickle  and  pain.  All 
we  can  say  is  that  for  pure  tones  these  ultimate  values  must 
lie  above  142  db  re  0.0002  dyne/cm^.  The  tolerance  thres- 
holds for  speech  20/  lie  at  130  db  (discomfort),  135  db 
(tickle)  and  above  139  db  (pain). 

We  still  do  not  know  what  the  values  would  be  If  the 
stimuli  had  not  been  presented  by  earphone,  but  If  the  sub- 
ject's head  had  been  placed  in  a homogenejous  sound  field. 

It  may  also  be  worthwhile  to  point  out  that  the  sounds  pre- 
sented to  the  subjects  were  continuous . The  Intensity  was 
increased  in  steps  of  2 db  every  1.5  seconds  between  100  and 
130  db,  and  in  1 db-steps  above  130  db.  This  procedure • avoided 
some  of  the  disagreeable  complications  that  might  have 
accompanied  the  sudden  onset  of  a high  intensity  stimulus.  It 
is,  however,  almost  certain  that  the  tolerance  thresholds  for 
sudden  onset  of  stimuli  would  lie  considerably  below  those 
shown  on  Pig.  l6.2.  Such  thresholds  may  be  influenced  not 
only  by  a startle  reaction,  but  also  by  the  fact  that  it  takes 
a time  Interval  of  the  order  of  l/lOO  sec . for  the  protective 
action  of  the  middle  ear  muscles  to  take  effect . 


Figure  16.2 

These  curves  show  various  determinations  of  toler- 
ance thresholds  for  pure  tones.  Curves  1,  2 and  3 
were  reported  by  Silverman  20/  at  the  Central 
Institute  for  the  Deaf  (CIDTT  in  St.  Louis.  Curve 
1 shows  the  intensity  level  at  which,  after  an  ex- 
tended period  of  getting  used  to  Intense  acoustic 
stimulation,  the  listeners  reported  "discomfort", 
and  Curve  2 marks  the  onset  of  "a  tickling  sensation" 
The  limit  of  the  earphones  was  exceeded  before  some 
of  the  experienced  listeners  complained  of  "pain" 

(Curve  3).  Curve  4 is  the  "threshold  of  feeling" 
obtained  py  Wegel  and  Curve  5 is  Bekesy's  3/ 

threshold  of  tickle.  Bekesy  found  that  at  fre- 
quencies below  15  cps  his  listeners  could  report 
consistently  in  terms  of  two  criteria.  Curves  6 and 
7,  labeled  "touch"  and  "pricking  in  middle  ear"  show 
the  central  tendencies  of  the  Judgments.  (After  J.C.R. 
Licklider  3/) . 
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FREQUENCY 


Effects  of  Age  on  Threshold.  The  auditory  thresholds 
discussed  above  were  usually  determined  for  young  listeners 
who  had  had  experience  In  auditory  tests  In  the  laboratory. 
Audiometric  studies  have  shown,  however,  that  there  Is  a 
progressive  Increase  In  the  threshold  of  hearing  as  a func- 
tion of  age.  Since  the  evidence  on  effects  of  age  has  been 
collected  In  audiometric  studies  rather  than  In  the 
laboratory,  a discussion  of  this  aspect  of  auditory  thresholds 
Is  Included  In  Section  l6.3  on  audiometry. 

Differential  Thresholds . Absolute  thresholds  and 
thresholds  of  tolerance  are  not  the  only  thresholds  we  can 
determine , We  may  wish  to  determine  whether  a listener  can 
distinguish  stimulus  A from  stimulus  B.  The  question  we 
ask  him  may  take  various  forms . We  can  ask  him  If  A and  B 
sound  similar  In  every  respect  or  If  the  stimuli  sound  dif- 
ferent in  any  respect.  We  can  then  make  our  questions  con- 
cerning differences  more  specific  by  asking,  for  Instance, 

If  A sounds  higher  or  lower  than  B.  If  we  were  to  use  pure 
tone  stimuli  we  could  make  tone  A differ  from  tone  B either 
In  frequency  or  in  Intensity  or  both.  We  expect  to  get  from 
the  listener  responses  that  will  yield  a statistical  answer 
to  the  question  of  how  small  a change  In  the  stimulus  he  can 
detect  with  a certain  reliability.  These  so-called  differen- 
tial thresholds  (or  thresholds  of  detectability  of  a change 
in  stimulus  dimensions  or  characteristics)  are  Important 
since  they  are  related  to  the.  listener's  ability  to  discrimin- 
ate one  signal  from  another;  and  Information  Is  usually 
conveyed  by  a set  of  changing  signals. 

There  are  a great  number  of  different  ways  In  which 
differential  thresholds  can  be  determined.  We  assume  that 
we  can  measure  stimulus  Intensity  and  frequency  with  suffi- 
cient accuracy  and  that  we  have  at  our  disposal  an  Infinite 
number  of  ways  in  which  we  can  go  from  stimulus  A to  stimulus 
B.  We  could,  for  Instance,  attempt  to  modulate  the  amplitude 
of  pure  tones  sinusoidally  or  by  means  of  a square  wave,  or 
by  means  of  a triangular  wave,  etc.  Next  we  could  vary  the 
psychophysical  method  that  we  use  to  determine  the  Just 
noticeable  differences* . The  value  that  we  obtain  for  a JND 
will  vary  with  the  way  In  which  we  present  the  stimuli  and 


* Differential  thresholds  are  also  called  difference 
limens  (DL's),  Just  noticeable  differences.  Just 
not  noticeable  differences,  etc.;  Just  noticeable 
differences  are  abbreviated  as  JND. 
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with  the  psychophysical  method.  It  will  also  depend  upon 
the  size  of  the  stimulus  ensemble  and  the  number  of  alterna- 
tive responses  a listener  is  allowed  to  make.  The  size  of 
the  JND  will  not  be  Independent  of  the  time  we  give  our 
listener  to  arrive  at  a determination  of  his  discriminative 
capacity.  Recently  certain  authors  have  reported  "learning 
processes"  22, 23^ 24/  in  this  connection  that  seem  to  con- 
tinue for  years.  It  should  be  fairly  clear,  therefore, 
that  there  is  not  just  one  value  for  a given  Just  noticeable 
difference  (in  the  literature)  but  there  are  quite  a few. 

The  question  is  often  raised  that  the  only  important 
figure  must  be  the  one  that  describes  the  organism's  ulti- 
mate capacity.  Discrimination  of  some  acoustic  signals, 
like  speech,  is  always  performed  in  a certain  time  span  and 
under  certain  conditions  of  interfering  noise.  Since  our 
main  concern  is  with  the  effects  of  noise  upon  man's 
auditory  performance,  we  shall  want  to  measure  the  JND's  in 
a certain  way.  We  may  wish  to  compare  JND's  obtained  under 
certain  standard  conditions  with  JND's  obtained  in  the  pre- 
sence of  a prescribed  amount  of  noise  or  after  exposure  to 
a specified  amount  of  noise. 

Figure  l6.3  shows  how  the  most  widely  accepted  values 
for  difference  thresholds  in  intensity  behave  for  different 
frequencies  and  for  different  sensation  levels.  We  note 
that  the  Just  noticeable  difference  in  sound  pressure  level 
decreases  from  about  3 db  in  the  vicinity  of  the  absolute 
threshold  to  about  0.25  db  at  sensation  levels  of  about 
60  db*.  This  should  not  be  interpreted  to  mean  that  a man 
could  correctly  identify  something  of  the  order  of  250 
intensities  for  a pure  tone  of  a particular  frequency.** 

It  is  one  task  for  a man  to  Judge,  most  of  the  time,  that  a 
1000  cps  tone  of  60  db  sound  pressure  level  is  louder  than  a 
tone  of  59*5  db  sound  pressure  level.  It  is  an  entirely  dlf.- 
ferent  task  for  a man  to  recognize  from  perhaps  250  different 
tones  (all  of  them  1000  cps  but  of  different  sound  pressure 


* For  a definition  of  sensation  level,  see  Chapter  15. 

**  We  arrive  at  the  figure  250  by  dividing  the  auditory 
range  (about  125  db)  by  the  average  JND  in  sound 
pressure  level  (about  0.5  db). 
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levels  spaced  one-half  db  apart)  which  one  is  labeled  59 «5 
db  and  which  one  is  labeled  60  db  sound  pressure  level . 

Riesz's  data  are  by  no  means  beyond  dispute;  but  they 
represent  the  most  comprehensive  set  of  measurements 
attempted  so  far.  They  were  obtained  monaurally  with  the  help 
of  a special  distortion-free  earphone  and  with  the  intensity 
of  the  stimulus  tone  varying  in  a sinusoidal  fashion.  For  a 
discussion  of  some  of  the  factors  Involved  in  determining  a 
DL  in  intensity  see  Stevens  and  Davis  26/.  For  results  that 
are  not  in  agreement  with  Riesz's  see  Q,  V.  Bekesy  27/  and 
also  Dlmmick  and  Olson  28/. 

So  far  we  have  dealt  only  with  Just  noticeable  dif- 
ferences in  the  intensity  of  pure  tones.  But  as  we  have  said 
before,  pure  tones  do  not  occur  very  often  in  our  natural  en- 
vironment and  we  would  much  rather  know  JND's  for  the  Intensity 
of  bands  of  noise  or  of  complex  tones.  We  do  not  possess  data 
on  bands  of  noise  but  we  have  a careful  investigation  of  sen- 
sitivity to  changes  in  the  Intensity  of  white  noise  stimuli. 
Figure  l6.4  shows  Miller's  29/  data  for  two  trained  observers. 
Note  that  the  JND  is  constant  for  sensation  levels  above  20  db. 
This  is  one  of  the  instances  in  which  the  so-called  Weber  law* 
holds . Actually,  the  JND  for  white  noise  is  somewhat  smaller 


* In  1834  Weber  stated  that  the  ratio  of  A.  l/l  (where 
I is  the  stimulus  intensity  and  A I the  Increment  in 
stimulus  intensity)  is  a constant  for  a Just  notice- 
able change  l‘n  intensity.  The  Weber  law  has  often 
been  extended  beyond  the  Intensltiye  domain  and  has 
even  been  applied  to  frequency  discrimination  of  pure 
tones.  Under  these  circumstances  authors  have  re- 
plotted the  Shower -Blddulph  data  (see  Figure  l6.5)  in 
an  attempt  to  show  that  the  A f /f  ratio  stayed  rela- 
tively constant,  at  least  for  frequencies  above  1000  cps. 


Figure  16.3 

Just  noticeable  difference  ^ I in  sound  pressure 
level  for  pure  tones  reported  by  RlesZ  25/.  The 
parameter  is  the  frequency  of  the  pure  tone . I 
represents  the  stimulus  intensity.  The  value  of 
A l/l  is  Indicated  at  the  right.  The  tendency 
ofAl/l  to  be  constant  at  high  sensation  levels 
should  be  noted. 
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than  0.5  db.  This  value  Is  not  too  different  from  Riesz's 
average  values  for  frequencies  between  1000  and  4000  cps . 

Let  us  not  overlook  the  fact  that  Riesz's  and  Miller's  data 
were  obtained  for  different  observers  and  by  different 
experimental  techniques.  We  might  emphasize  the  figure  of 
0.5  db  since  one  finds  so  often  in  the  literature  the  loose 
statement  that  the  unit  of  1 db  was  chosen  because  It  repre- 
sents the  minimum  change  In  Intensity  that  can  be  dis- 
criminated . 

The  size  of  the  JND  for  pure  tones  depends  critically 
upon  the  sensation  level  of  the  stimulus.  It  depends  con- 
siderably upon  the  time  Interval  during  which  the  intensity 
of  the  stimulus  Increases  or  decreases.  This  has  been  shown 
by  Garner  and  Miller  30/  for  short  time  Intervals  and  by 
Lawrence  et  al  31/  for  stimuli  that  last  as  long  as  30  or 
even  60  seconds  with  a change  In  rate  of  up  to  5 db/mlnute. 
Actually,  Intensity  discrimination  seems  to  be  best  for 
sounds  of  about  1 second  duration.  Frequency  discrimina- 
tion on  the  other  hand  does  not  seem  to  be  too  much  affected 
by  Interposing  silent  Intervals  up  to  10  seconds  between  the 
standard  and  the  test  tone  32/  while  the  JND  for  intensity 
Increases  with  the  length  of  the  silent  Interval  between 
standard  and  comparison  tone . 

Let  us  now  turn  to  frequency  discrimination.  The  data 
that  are  most  commonly  accepted  were  taken  by  Shower  and 
Biddulph  33/  (see  Fig.  16.5).  Data  on  JND's  in  frequency 
for  pure  tones  have  recently  been  summarized  in  a manner 
that  Illustrates  some  of  the  variables  that  affect  perfor- 
mance 24/  (Fig.  l6.6).  JND's  In  frequency  pose  essentially 
the  same  problem  as  JND's  In  Intensity.  We  must  distinguish 
between  (l)  ability  to  tell  the  difference  between  two 
stimuli  presented  in  close  succession  and  (2)  ability  to 
Identify  correctly  one  out  of  a large  set  of  stimuli  when 
presented  In  Isolation.  The  former  problem  is  a case  of 


Figure  l6.4 

Just  noticeable  differences  In  sound  pressure  level 
for  white  noise  in  a 7000  cps  band.  Increments  in 
intensity  heard  50  percent  of  the  time  are  plotted 
as  a function  of  the  intensity  of  the  noise  In 
decibels  above  the  threshold  of  hearing.  The  Weber 
fraction  Ck  l/l  remains  essentially  constant  above 
the  20  db  sensation  level.  (After  Miller  29/) 
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relative  discrimination,  while  the  latter  involves  absolute 
identification.  Thus  the  fact  that  we  are  able  to  identify 
a 1003  cps  tone  from  a 1000  cps  tone  does  not  mean  that  we 
can  simply  divide  the  frequency  range  in  cps  by  the  average 
JND  to  find  out  how  many  different  pure  tones  an  average  sub- 
ject can  correctly  Identify.  One  recent  study  using  fre- 
quencies between  100  and  8OOO  cps  showed  that  average  listeners 
gave  results  that  could  be  considered  equivalent  to  perfect 
identification  among  five  tones  only  3^/«  There  are,  of 
course,  sizeable  individual  differences  in  terms  of  more  basic 
concepts.  Though  we  have  no  complete  investigation  of  the 
problem,  there  is  evidence  that  permits  us  to  estimate  that 
people  with  so-called  "absolute  pitch"  have  apparently  learned 
to  identify  correctly  a number  of  tones  perhaps  10  to  20  times 
larger  than  ordinary  subjects.  The  question  still  remains 
open  as  to  whether  subjects  are  able  to  do  better  because 
they  have  some  verbal  labels  to  pin  upon  the  stimuli,  or 
because  they  are  "superior  discriminators"  in  every  respect. 

The  evidence  on  this  latter  point  would  seem  to  indicate 
that  a person  having  "absolute  pitch"  is  not  much  better  than 
the  run-of-the-mill  laboratory  subject,  when  it  comes  to  the 
identification  of  loudness.  (W.  D,  Ward,  Personal  Communica- 
tion) . 


In  the  above  discussion  we  have  considered  discrimina- 
tions of  a relatively  simple  type  involving  changes  of 
intensity  or  frequency  only.  A more  complex  type  of  discrimina- 
tion was  investigated  by  Karlin  35/  who  measured  the  ability 
of  subjects  to  detect  small  changes  in  a noise  spectrum.  The 
chief  characteristics  investigated  were  the  frequency  and 
Intensity  of  noises. 

The  stimuli  were  obtained  by  passing  white  noise,  in  which 
all  components  were  of  uniform  Intensity  level,  through  a band- 
pass filter.  Variations  in  the  pitch  of  the  noise  were  achieved 


Figure  I6.5 

Just  noticeable  differences  in  frequency  for  pure 
tones,  as  obtained  by  Shower  and  Blddulph  33/. 

The  ordinate  is  the  ratio  of  the  JND  to  the  fre- 
quency f,  and  the  parameter  is  the  sensation  level 
of  the  tone  in  dbs . The  frequency  was  varied  in  an 

approximately  sinusoidal  fashion  at  about  2 cps. 
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by  varying  the  sharpness  of  the  cutoff  attenuating  char- 
acteristics of  the  band -pass  filter;  for  example,  higher 
pitch  resulted  from  sharpening  the  high-pass  attenuation 
characteristic,  and  flattening  the  low-pass  characteristic. 
Variations  In  loudness-  were  achieved  by  raising  and  lowering 
the  overall  intensity  of  the  noise.  In  both  tests  a standard 
noise  of  two  seconds*  duration  was  presented  first,  followed 
Instantaneously  by  the  variable  noise,  also  of  two  seconds' 
duration. 

The  changes  In  spectrum  shape  that  were  Just  dls- 
crlmlnable  by  Karlin's  subjects  In  his  pitch  test  are  shown 
In  Pig.  16.7.  The  curves  marked  "lower"  and  higher"  were 
Judged  to  be  lower  and  higher  In  755^  of  the  subjects ' votes . 

In  the  test  of  loudness  discrimination,  the  DL  for  a band  of 
noise  from  500  to  2000  cps  was  about  0.4  db.  Indicating  fair 
agreement  with  Miller's  data. 

Karlin  reported  that  the  pitch  and  loudness  test  appear 
to  measure  significant  abilities  which  are  largely  independent 
of  each  other.  Neither  could  be  used  to  predict  performance 
on  the  other.  He  also  showed  that  there  Is  little  overlap 
between  the  abilities  Involved  In  the  noise  tests  and  those  In 
corresponding  tests  employing  pure  tones  Instead  of  noises . 

Further  experiments  on  differential  thresholds  for  com- 
plex tones  have  been  reported  by  K.  N.  Stevens  36/.  He  mea- 
sured DL's  In  frequency  and  In  bandwidth  for  damped  waves  and 
for  noise  shaped  by  simple  tuned  circuits.  In  general,  his 


Figure  16.6 

Data  on  pitch  discrimination  reported  by  several 
Investigators  are  shown  In  a single  graph.  Data 
obtained  by  different  psychophysical  procedures 
are  shown  by  different  kinds  of  points  on  the  graph. 
The  closed  circles  represent  data  taken  by  the  AX 
method,  and  the  open  circles  represent  DL's  from  the 
ABX  method.  Data  obtained  by  a frequency  modulation 
procedure  are  shown  by  X's,  and  data  from  the  quantal 
method  by  squares . Almost  all  data  lie  within  the 
dashed  lines,  which  Indicate  a spread  of  one  decade. 
(See  Rosenbllth  and  Stevens  24/  for  a discussion  of 
the  AX  and  ABX  methods ) . 
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results  indicate  that  as  the  bandwidth  of  the  stimulus  Is 
broadened,  the  discrimination  of  changes  In  bandwidth  and 
In  frequency  becomes  less  acute. 

We  could,  of  course,  determine  JND/s  for  a great  number 
of  other  variables  such  as  direction  or  distance  of  a sound 
source.  We  could  investigate  the  way  In  which  these  thres- 
holds differ  when  we  manipulate  the  duration  (and  thus  alter 
to  some  extent  the  spectrum)  of  the  sound  In  question.  We 
could  observe  how  a JND  differs  when  the  listening  Is  done 
monaurally  or  binaurally.  We  could  choose  a rather  complex 
stimulus  like  a musical  tone  and  test  for  the  absence  or 
presence  of  certain  harmonics  or  distortions.  While  these 
data  would  be  of  considerable  Interest  from  a psycho-acoustic 
viewpoint,  they  do  not  concern  us  sufficiently  here  to  go  Into 
details . 

Loudness . We  have  now  seen  that  In  trying  to  under- 
stand man's  responses  to  acoustic  stimuli,  we  have  had  to 
measure  such  quantities  as  absolute  thresholds,  tolerance 
thresholds,  and  finally  differential  thresholds  or  Just  notice- 
able differences . These  characteristic  values  permit  us  to 
some  extent  to  evaluate  man's  discriminative  capacities  In 
the  domain  of  hearing.  However,  we  may  want  to  know  more 
than  how  weak  a stimulus  man  can  detect  or  how  strong  a 
stimulus  man  can  stand.  We  may  also  want  to  know  what  rela- 
tions exist  among  the  psychological  responses  evoked  by  various 
loud  or  annoying  noises  whose  physical  characteristics  we  are 
able  to  measure.  For  example,  we  might  wonder  If  B sounds  as 


Figure  16.7 

Just  noticeable  differences  In  the  spectrum  of  a 
noise  as  measured  by  Karlin  35/-  The  curves  marked 
"higher”  and  "lower"  represent  the  noise  spectra 
that  are  correctly  differentiated  from  the  stan- 
dard 75  per  cent  of  the  time . 
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loud  as  A.  More  generally,  we  may  ask  how  the  loudness* 
of  a sound  changes  as  we  change  Its  Intensity.  We  should 
thus  like  to  he  able  to  obtain  psychophysical  scales  that 
will  relate  man's  Judgment  of  the  loudness  of  a sound  to 
Its  sound  pressure  level. 

Most  of  our  discussion  In  the  present  chapter  has 
Involved  single  frequency  sounds  or  random  stimuli  such  as 
thermal  noise . Sounds  of  organized  complexity,  such  as 
musical  tones  or  speech  sounds,  are  different  In  character; 
here  multiple  cues  determine  the  response  to  the  stimulus. 

These  cues  are  not  necessarily  Independent  of  each  other, 
and  hence  are  difficult  to  measure . This  means  that  we 
should  not  expect  simple  psychophysical  relations  to  hold 
for  the  loudness  of  such  complex  tones . 

We  understand  that  we  can  In  general  rate  sounds  from 
soft  to  loud.  It  seems  fairly  obvious  that  sounds  become 
louder  as  their  Intensity  is  Increased**.  However,  we  have 
to  take  certain  precautions  If  we  want  to  establish  a loud- 
ness scale  of  a sort  where  sound  A having  one  unit  of  loudness 


The  American  Standard  Acoustical  Terminology 
defines  loudness  as  the  Intensive  attribute  of 
an  auditory  sensation.  In  terms  of  which  sounds 
may  be  ordered  on  a scale  extending  from  soft  to 
loud.  It  also  states  that  loudness  depends 
primarily  upon  the  sound  pressure  of  the  stimulus, 
but  It  also  depends  upon  the  frequency  and  wave 
form  of  the  stimulus . While  this  definition 
involves  the  term  'sensation'.  It  should  be  clear 
that  we  shall  be  concerned  only  with  measurable 
human  responses  and  their  relations  to  physical 
characteristics . 

There  are  notable  exceptions  to  this  monotomlc 
relation.  The  most  striking  exception  occurs 
for  high  stimulus  intensities  at  low  frequencies 
(perhaps  20  cps  and  below;  see  Bekesy  37/) . 

This  behavior  has  been  attributed  to  the  fact 
that  for  the  large  displacements  involved  the 
ossicles  vibrate  in  .a  different  mode.  In  a manner 
that  reduces  the  actual  stimulus  power  delivered 
to  the  Inner  ear.  (See  Appendix  l). 
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will  sound  one-half  as  loud  as  sound  B of  two  loudness 
units,  and  at  the  same  time,  twice  as  loud  as  sound  C having 
one-half  unit  of  loudness.  The  problem  of  establishing 
such  a scale,  known  technically  as  a ratio  scale  38/,  is  an 
intricate  one . It  is  by  no  means  certain  that  all  the  various 
methods  that  one  can  use  in  order  to  establish  this  scale  will 
yield  the  same  loudness  scale.  This  should  not  surprise  us 
too  much  since  similar  difficulties  (though  most  of  the  time 
not  of  the  same  magnitude)  are  encountered  .whenever  a quantity 
is  measured  by  several  methods . 

Whenever  we  use  natural  numbers  like  those  we  derive 
from  counting  procedures  we  have  to  specify  their  meaning 
rather  carefully.  When  we  see  in  an  advertisement  that 
substance  A is  three  times  as  effective  as  substance  B,  we 
do  not  necessarily  know  how  effectiveness  has  been  defined. 

The  writer  of  the  advertisement  may  have  asked  a group  of 
people  if  substance  A is  more  effective  than  substance  B and 
have  found  that  75  percent  of  the  population  believe  substance 
A is  more  effective,  while  only  25  percent  believe  that  sub- 
stance B is  more  effective.  This  does  not,  of  course,  mean 
that  substance  A is  three  times  as  effective  as  substance  B 
if  we  attach  some  other  criterion  to  the  meaning  of  the  word 
"effectiveness" . 

Let  us,  therefore,  proceed  step-by-step.  First  let  us 
see  if  we  can  establish  equal  loudness  relations  for  pure  tones. 
In  order  to  do  this  let  us  ask  the  following  question.  If  we 
raise  the  sound  pressure  level  of  pure  tones  by  10  db  above 
the  absolute  threshold,  will  these  tones  be  judged  equally  loud? 
Will  the  same  be  true  if  we  raise  the  sound  pressure  level  for 
different  frequencies  by  80  db?  Such  an  experiment  assumes 
implicitly  that  all  tones  are  equally  loud  at  threshold.  This 
is  merely  an  assumption  but  an  eminently  reasonable  one.  As 
long  as  we  add  just  a few  decibels  to  threshold  Intensities, 
we  preserve  our  equal  loudness  relationship  fairly  well,  with 
the  possible  exception  of  the  very  low  frequencies.  However, 
if  we  continue  to  add  an  equal  number  of  decibels  at  all  fre- 
quencies, we  find  that  the  loudness  of  pure  tones  in  the  low 
and  high  frequency  regions  increases  more  rapidly  than  the 
loudness  of  tones  in  the  range  between  1000  and  4000  cps . As 
the  equal  loudness  contours  in  Fig.  16.8  show,  a 1000  cps  tone 
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having  a sensation  level*  of  70  db  Is  Judged  equally  loud 
to  a 100  cps  tone  of  approximately  40  db  sensation  level  (or 
78  db).  However,  a 1000  cps  tone  having  a sound  pressure 
level  of  20  db.  Is  Judged  as  loud  as  a 100  cps  tone  having  a 
sound  pressure  level  of  51  db . An  acceptable  generalization 
for  equal  loudness  contours  might,  therefore,  be  formulated 
as  follows:  near  the  absolute  threshold,  equal  loudness  con- 

tours follow  rather  closely  the  contours  of  equal  sound  pres- 
sure level.  Equal  loudness  contours  measured  In  terms  of 
(1)  sound  pressure  at  the  eardrum,  and  (2)  sound  pressure  In 
a free  field  before  Insertion  of  the  head  are  shown  In 
Figs.  16.8  and  16.9. 

Prom  these  equal  loudness  contours  we  can  already  draw 
the  conclusion  that  In  order  to  Judge  the  loudness  of  a tone 
It  Is  not  enough  to  know  merely  Its  stimulus  Intensity,  but 
we  must  know  Its  frequency  also.  Historically,  the  next 
concept  that  was  Introduced  In  the  development  of  a loudness 
scale  was  that  of  loudness  level . Its  definition  states  that 
the  loudness  level  (In  phons)  of  a sound  Is  numerically  equal 
to  the  sound  pressure  level  In  decibels  (relative  to  0.0002 
dyne/cm2)  of  a 1000  cps  tone  that  Is  Judged  by  the  listeners 
to  be  equally  loud.  This  concept  of  loudness  level  permits 
us.  If  we  are  concerned  only  with  the  loudness  of  a tone,  to 
Ignore  Its  frequency.  All  that  Is  necessary  Is  to  compare 
Its  loudness  with  that  of  a 1000  cps  tone;  If  we  can  measure 
the  SPL  of  the  1000  cps  tone  we  can  then  state  the  loudness 
level  of  the  tone  In  phons. 


* The  sensation  level  (level  above  threshold)  of  a 
sound  Is  the  pressure  level  of  the  sound  In  de- 
cibels above  Its  threshold  of  audibility  for  the 
Individual  observer. 


Figure  16.8 

Contours  of  equal  loudness.  The  sound  pressure 
levels  were  measured  In  a free-fleld  before  entry 
of  the  subject.  (After  Fletcher  39/) • 
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we  have  now  made  several  steps  toward  a genuine  loud- 
ness scale  for  pure  tones.  Next  we  need  a relation  between 
loudness  level  and  loudness,  because  the  loudness  level  of  a 
tone  does  not  yet  tell  us  which  tone  Is  twice  as  loud  as  the 
tone  we  are  Interested  In.  In  particular,  a tone  having  a 
loudness  level  of  20  phons  Is  by  no  means  twice  as  loud  as  a 
tone  whose  loudness  level  Is  10  phons.  A relation  between 
loudness  and  loudness  level  was  obtained  experimentally  and 
a unit  of  loudness  was  chosen  to  label  the  ordinate  (See 
Fig.  16.10).  The  loudness  of  our  reference  tone  of  1000  cps 
at  40  db  SPL  Is  called  one  sone  (or  1000  mllllsones).  We  now 
see  that  a tone  whose  loudness  level  Is  only  20  phons  has  a 
loudness  of  90  mllllsones  while  a tone  whose  loudness  level 
Is  60  phons  has  a loudness  of  5 sones . One  hundred  phons 
corresponds  to  a loudness  of  100  sones . 

Let  us  not  forget  that  the  curve  of  loudness  vs. 
stimulus  Intensity  (SPL)  In  db  for  various  frequencies  would 
not  be  unique . We  have  to  realize  that  the  loudness  vs . 
Intensity  function  Is  different  for  different  frequencies. 

The  single  curve  of  Pig.  I6.IO  Is  the  result  of  the  use  of  the 
Intermediary  concept  of  loudness  level,  and  In  some  sense 
demonstrates  the  usefulness  of  that  particular  concept. 

Let  us  now  consider  the  loudness  of  some  stimuli  more 
realistic  than  pure  tones:  white  noise,  speech,  musical  tones, 

factory  noises.  Jet  noises,  etc.  The  loudness  of  these  sounds 
could  be  evaluated  by  tedious  psychophysical  tests . We  should 
like,  however,  to  be  able  to  estimate  their  loudness  on  the 
basis  of  physical  measurements,  utilizing  the  pure -tone  loud- 
ness curve  that  has  already  been  established.  Some  preliminary 
computational  schemes  for  the  loudness  of  such  sounds  have  been 
suggested  but  we  are  not  yet  on  really  firm  ground.  Several 
Investigators  39^41,42,43/  have  been  concerned  with  measurements 
of  the  loudness  of  complex  tones;  they  found  that  the  loudness 
of  these  complex  tones  can  be  predicted  by  adding  the  loudnesses 
(not  the  Intensities  1 ) of  the  several  components . This  pro- 
cedure falls  If  there  Is  too  much  "interaction"  between  the 


Figure  16.9 

Contours  of  equal  loudness.  The  sound  pressure 
levels  were  measured  at  the  eardrum,  and  an  ear- 
phone was  used  to  deliver  the  tone . (After 
Fletcher  and  Munson  40/,  re -plotted  by  Stevens 
and  Davis  26/) . 
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separate  components . It  would  lead  us  too  far  to  go  Into 
detail  concerning  that  Interaction  but  It  appears  safe  to 
assume  that  loudness  addition  will  fall  when  we  get  Into  a 
region  of  mechanical  nonlinearity  (l.e.,  very  Intense  stimuli). 
Loudness  addition  will  also  fall  If  the  components  of  the  com- 
plex stimulus  lie  sufficiently  close  In  frequency  so  that  they 
compete  for  the  same  physiological  units . 

Pollack  44,  45j^6,47/  has  recently  obtained  loudness 
Judgments  from  several  highly  trained  observers  for  such 
stimuli  as  white  noise,  bands  of  noise.  Interrupted  noise,  and 
speech.  Equal  loudness  contours  for  bands  of  noise  are  shown 
In  Pig.  16,11.  On  the  basis  of  the  present  evidence  It  appears 
that  as  long  as  the  sound  Is  fairly  continuous  In  character, 
and  as  long  as  certain  other  conditions  are  fulfilled,  we  can 
obtain  reasonably  consistent  loudness  Judgments  for  these 
types  of  stimuli  In  laboratory  situations . However,  the 
accuracy  and  consistency  of  these  Judgments  from  highly  trained 
observers  Is  by  no  means  such  that  we  should  expect  to  be  able 
to  make  use  of  their  loudness  scale  for  complex  stimuli  In 
everyday  situations . 

Pollack  obtained  data  In  experiments  In  which  the 
loudness  of  a white  noise  was  Judged  directly.  A comparison 
of  these  data  with  others  derived  from  experiments  Involving 
loudness  transformations  between  the  noise  and  a 1000  cps  tone 
pointed  up  considerable  discrepancies.  Let  us  quote  an  example 
a white  noise  of  100  db  SPL  Is  Judged  directly  to  have  a 
loudness  of  about  700  sones  while  Its  "derived"  loudness  barely 
exceeds  100  sones.  Pollack  points  further  to  the  fact  that, 
while  a given  listener  will  yield  repeatable  results  In  equal 
loudness  matches,  differences  among  listeners  may  reach  as 
much  as  30  db. 

Recently  the  whole  concept  of  a unique  loudness  scale 
has  been  subjected  to  a searching  critique  by  Garner  48/.  In 
a series  of  experiments  he  has  studied  the  extent  to  which 
listeners  vary  In  making  half -loudness  Judgments.  He  has 


Figure  16.10 

Loudness  In  sones  as  a function  of  loudness 
level  In  phons.  Here  the  sounds  are  assumed  to 

be  presented  to  an  uncovered  ear.  (After  Fletcher 
and  Munson  40/) . 
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furthermore,  been  concerned  with  the  problem  of  whether 
listeners  will  generate  the  same  loudness  scale  when  the 
operation  by  means  of  which  they  generate  the  scale  Is  frac- 
tionation, (as  for  example,  half -loudness ) or  bisection  (what 
tone  lies  halfway  between  tone  A and  tone  B In  loudness). 

Pollack  has  also  shown  that  If  thermal  noise  Is 
Interrupted  at  rates  between  2-  and  10  seconds,  one  will 
then  determine  a loudness  that  Is  Judged  greater  than  when 
noise  of  the  same  level  Is  presented  In  a continuous  fashion. 
It  would  appear  that  under  these  circumstances,  the  temporal 
characteristics  of  the  nervous  system  prevent  the  maintenance 
of  a single  monotonlc  relation  between  loudness  and  energy. 

In  spite  of  all  these  difficulties  the  last  two  decades 
have  seen  several  attempts  to  calculate  the  loudness  of  sounds 
directly  from  their  sound  pressure  spectra.  Beranek  et  al  43/ 
discussed  these  attempts  In  a recent  paper  In  the  following 
manner;  ”ln  their  classical  work  on  loudness,  Fletcher  and 
Munson  40/  developed  a method  of  calculation  suitable  for 
determining  the  loudness  of  complex  tones  from  the  frequency 
spectrum.  The  loudness  of  a continuous  spectrum  noise  may  be 
calculated  by  a procedure  developed  by  Fletcher  and  Munson  4l/ 
which  requires  a subjectively  determined  masking  audiogram  as 
starting  data.  Each  of  these  methods  Is  suitable  only  for 
the  particular  kind  of  noise  for  which  It  was  developed,  and 
each  Is  somewhat  cumbersome  to  use . Fletcher  and  Munson  also 
outlined  a procedure  for  calculating  the  masking  audiogram  of 
a continuous  spectrum  noise  given  Its  frequency  spectrum  4l/. 
They  correctly  restricted  their  method  by  the  condition  that 


Figure  l6.ll 

Equal  loudness  contours  for  bands  of  noise 
(bandwidth  about  250  to  300  mels)  as  a function 
of  the  center  frequency  of  the  noise  band.  The 
parameter  Is  the  loudness  level,  or  the  sound 
pressure  level  of  an  equally  loud  1000  cps  tone . 
Listening  was  monaural,  and  the  sound  pressure 
levels  were  measured  Just  beneath  the  cushion 
of  the  earphone.  (After  Pollack  46/) . 
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the  Intensity  I does  not  change  abruptly  from  one  frequency 
region  to  another.  When  one  attempts  to  use  their  method 
beyond  this  restriction,  large  discrepancies  between  cal- 
culated and  subjectively  determined  loudness  values  can  occur. 
For  example,  attempts  to  apply  the  Fletcher -Munson  procedure 
to  the  calculation  of  the  loudness  of  narrow  bands  of  noise 
located  at  low  frequencies  leads  to  loudness  levels  which 
are  lower  than  those  subjectively  determined  by  up  to  25  phons. 

Beranek  and  his  co-authors  expressed  the  opinion  that 
" a simple  objective  procedure  for  determining  the  loudness 
of  any  type  of  noise  with  reasonable  accuracy"  would  be  of 
great  value  In  engineering  applications . On  the  basis  of 
this  evaluation  of  the  situation  and  with  the  hope  of  provid- 
ing a rational  basis  for  a loudness  meter  they  described  an 
equivalent  tone  method  for  the  calculation  of  the  loudness  of 
sounds.  The  main  steps  In  this  Ingenious  procedure  are  as 
follows : 

"l.  Divide  the  noise  spectrum  Into  bands  of  fre- 
quencies, each  at  least  greater  In  width  than  a 
critical  bandwidth  for  hearing,  but  not  greater 
In  width  than  about  600  mels.  Bands  between  300 
and  600  mels  In  width  seem  best. 

2.  Determine  the  sound-pressure  level  In  decibels 
for  each  band. 

3.  By  means  of  the  equal -loudness  contours  for 
pure  tones,  find  the  loudness  level  In  phons  for 
each  band  corresponding  to  the  sound-pressure  level 
o.f  step  2 and  the  mean  frequency  of  the  band. 

4.  Using  the  relation  between  loudness  and  loudness 
level  (Fig.  16.10)  determine  the  loudness  In  sones 
contributed  by  each  band. 

5.  Add  the  Individual  values  of  loudness  to  obtain 
the  total  loudness  In  sones." 

Comparison  of  loudness  values  calculated  by  the  equi- 
valent tone  method  with  experimental  data  obtained  by  Pollack 
44,46/,  Pletcher-Munson  and  others  shows  satisfactory  agree- 
ment. Mlntz  and  Tyzzer  49/  have  used  the  equivalent  tone 
method  as  a basis  for  developing  a loudness  chart  for  octave- 
bands  of  noise.  However,  a really  extensive  empirical  test 
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of  this  method  Is  still  to  come . In  view  of  the  above 
reported  difficulties  with  a loudness  scale  for  pure  tones, 
one  should  not  be  too  optimistic  about  the  chances  of  develop- 
ing a simple  loudness  meter.  There  Is  at  least  some  reason 
for  believing  that  loudness  as  an  aspect  of  man's  response 
to  sound  has  been  over-rated,  otherwise  the  amount  of  varia- 
bility In  loudness  Judgments  would  be  Indicative  of  a rather 
serious  situation. 

In  summary,  we  have  seen  that  we  are  able,  under  certain 
restrictions,  to  set  up  psychophysical  scales  of  loudness  for 
pure  tones  and  noises  of  a more  or  less  continuous  type,  but 
that  we  must  beware  of  extrapolating  those  scales  for  types 
of  stimuli  for  which  they  have  not  been  validated  empirically. 
In  view  of  all  these  findings,  we  are  yet  rather  far  from 
being  able  to  build  a loudness  meter  that  will  Imitate  the 
Judgments  of  the  average  man  over  a realistic  range  of  stimulus 
conditions . A further  word  of  caution  should  be  directed 
towards  those  who  assume  that  loudness  and  annoyance  are  more 
or  less  synonymous  (See  Sections  17-3  and  l8.4). 


Pitch.  Some  of  the  readers  may  be  rather  surprised  that 
up  to  this  point  we  have  talked  very  little  about  the  sensation 
of  pitch.  Pitch  perception  Is  very  often  Identified  with  hear- 
ing and  most  theories  of  hearing  aim  at  explaining  man's 
ability  to  discriminate  pitch. 

What  is  pitch?  The  American  Standards  Terminology 
defines  It  as  "that  attribute  of  auditory  sensation  In  terms 
of  which  sounds  may  be  ordered  on  a scale  extending  from  low 
to  high,  such  as  a musical  scale . " If  we  forget  for  the  moment 
about  the  use  of  the  term  sensation,  we  could  realize  that  the 
preoccupation  with  pitch  stems  largely  from  the  period  in  which 
laboratory  subjects  were  Judging  pure  tone  stimuli.  A pitch 
scale,  the  mel  scale,  has  been  developed  for  pure  tone  stimuli 
and  represents  one  of  the  best  examples  of  a psychophysical 
scale. (See  Pig.  16.12).  Pitch  Is  undoubtedly  an  Important 
aspect  of  auditory  perception  and  yet  most  attempts  to  quantify 
the  pitch  of  complex  sounds  (resembling  the  sounds  we  encounter 
In  nature.  In  everyday  life,  and  even  in  the  vicinity  of  noisy 
engines  and  machines)  have  been  relatively  unsuccessful.  The 
pitch  of  complex  sounds  Is  often  schizophrenic  In  Its  behavior. 
Recently,  workers  In  this  field  have  assigned  at  least  two 
dimensions  to  the  pitch  of  complex  sounds  31,32, 33/. 
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From  the  rather  practical  viewpoint  that  Is  ours  here, 
there  Is  not  much  concern  for  pitch  Judgments  In  the  larger 
field  of  blo-acoustlcs . Pitch  belongs  more  particularly  to 
the  sub-field  of  psycho-acoustics.  There  are  three  areas  in 
which  modified  pitch  perception  might  play  a role  In  the 
phenomena  with  which  we  are  concerned  In  this  handbook. 

There  Is,  first  of  all,  the  observation  that  the  pitch 
of  a pure  tone  is  not  Independent  of  stimulus  Intensity 
53j56/.  However,  this  phenomenon  has  recently  been  shown  to 
be  so  much  affected  by  Individual  differences  that  we  are  not 
left  with  any  particularly  safe  generalization.  There  is, 
secondly,  the  fact  that  the  presence  of  bands  of  noise  modifies 
somewhat  the  pitch  of  pure  tones  57^58,39,60/.  Again  we  find 
a great  deal  of  variability  and  we  are  at  present  not  Inclined 
to  believe  that  this  pitch  shift  Is  particularly  Important 
from  the  viewpoint  of  auditory  communication.  Prom  our  point 
of  view  a most  Important  effect  upon  pitch  occurs  In  the  neigh- 
borhood of  frequency  regions  of  severe  hearing  loss.  Judgment 
of  pitch  is  then  often  badly  Impaired  and  distorted,  and 
tinnitus  may  Interfere  with  pitch  perception  in  a drastic  man- 
ner. We  shall  return  to  this  aspect  when  we  consider  the  after- 
effects of  severe  noise  exposures  and  at  that  time  consider 
If  there  Is  a possible  relation  to  faulty  speech  communication. 

Localization  of  Sounds  In  Space . In  addition  to  absolute 
and  differential  pitch  functions  and  loudness  functions, 
psychologists  have  quantified  several  other  attributes  of 
"auditory  sensation"  such  as  volume,  brightness  and  density. 
However,  there  Is  little  to  make  us  take  an  interest  in  these 


Figure  16.12 

The  pitch  function.  This  curve  shows  how  the 
perceived  pitch  of  a tone  (measured  In  mels) 
changes  as  a function  of  the  frequency  of  the 
stimulus . This  pitch  function  was  determined  at 
a loudness  level  of  60  phons . The  mel  scale  was 
constructed  by  assigning  arbitrarily  the  number 
1000  to  the  pitch  of  a 1000  cps  tone . The  num- 
ber 500  Is  assigned  to  the  pitch  of  a tone  that 
sounds  half  as  high,  etc . (After  Stevens  and 
Volkmann  50/) . 
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attributes  from  the  rather  utilitarian  viewpoint  that  has 
been  ours.  There  is,  however,  one  other  function  that  ought 
to  Interest  us:  how  do  people  localize  sounds  In  space?  Does 

the  fact  that  man  has  two  Input  channels  (ears)  help  him  In 
determining  from  which  direction  a particular  sound  arrives 
and  also  how  far  away  from  him  the  sound  source  is  located? 

Is  man  helped  by  the  fact  that  he  can  move  his  head? 

Let  us  say  at  the  outset  that  we  know  relatively  little 
concerning  the  way  In  which  distance  Is  determined  and  must, 
therefore,  limit  ourselves  to  the  problem  of  identifying  the 
direction  of  a source  of  sound.  The  usefulness  of  this 
ability  is  hardly  In  dispute  and  the  accuracy  with  which  man 
seems  normally  to  be  able  to  do  the  Job  Is  rather  startling 
(if  we  disregard  certain  confusions  In  symmetrical  front  and 
rear  locations ) . 

People  can  localize  sound  sources  Indoors  and  outdoors, 
l.e.,  almost  as  well  In  the  presence  of  reverberation  as  In 
Its  absence  6l,62/.  They  seem  to  use  localization  In  order 
to  enhance  Intelligibility  as,  for  Instance,  when  they  listen 
to  Just  one  conversation  during  a cocktail  party.  (This 
should,  however,  not  be  meant  to  imply  that  this  kind  of 
"filtering".  In  listening  to  one  talker  at  a time.  Is  all  a 
matter  of  localization) . 

Let  us  imagine  a man  whose  head  Is  Immersed  In  a sound 
field,  or  a man  who  is  wearing  a headset  through  which  acoustic 
stimuli  are  being  presented.  Which  aspects  of  the  stimulus  do 
we  have  to  measure  In  order  to  be  able  to  predict  our  subject's 
ability  to  localize  the  direction  of  the  sound  source?  Again 
we  must  distinguish  between  pure  tones  and  the  transient 
"sounds  of  nature".  In  the  case  of  a pure  tone  we  can  define 
a more  or  less  steady-state  situation  for  which  there  Is  a 
definite  amplitude  and  phase  relation  between  the  stimulus  for 
the  right  ear  and  the  stimulus  for  the  left  ear.  In  the  case 
of  a complex  but  steady  sound  the  stimulus  situation  Is 
characterized  by  many  such  relations  (this  may  be  the  reason 
why  such  complex  sounds  are  more  readily  localized  than  the 
laboratory-bred  pure  tones).  Finally  there  are  the  changing 
sounds  which  for  our  present  purposes  we  may  describe  as  a 
series  of  pulses.  In  this  last  Instance  we  can  talk  about  a 
time  relation  only:  the  pulse  reaches  one  ear  earlier  than 

the  other  one . 
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Let  us  now  inquire  how  man  responds  to  these  various 
situations.  In  the  case  of  a pure, steady  tone  the  evidence 
is  strongly  suggestive  of  a double  mechanism.  For  the  low 
frequencies,  the  phase  relation  seems  to  act  as  the  more 
Important  clue  while  for  high  frequencies  the  Intensity  clues 
seem  the  more  Important  63/.  In  between,  around  3OOO  cps, 
both  these  mechanisms  seem  to  feed  conflicting  information 
with  the  effect  that  pure  tones  in  this  frequency  region  are 
rather  poorly  localized.  A little  reflection  on  the  relation 
of  headslze  to  wavelength  phase  information  can  be  of  little 
value  at  very  high  frequencies. 

If  we  now  examine  what  happens  in  the  case  of  pulse- 
llke  sounds  we  find  that  there  are  two  factors  that  we  can 
isolate  and  that  contribute  to  man's  ability  to  point  with 
his  eyes  closed  and  say  "that's  where  the  sound  is  coming  from”. 
First  we  find  that  two  pulses  reasonably  equal  in  intensity 
that  reach  the  same  ear  separated  by  short  time -Intervals  up 
to  several  hundredths  of  a second  are  in  general  judged  to  be 
one,  namely  the  first  one.  The  organism  seems  too  busy  to 
digest  the  "echo-pulse”  (or  even  echo-pulses  if  there  are 
more  than  two  pulses)  to  be  able  to  tackle  the  next  one.  After 
a short  while  the  registers  clear  slowly  and  another  response 
can  be  evoked.  There  is  no  single  time  Interval  that  can  be 
given  for  this  unresponsive  period.  If  the  second  pulse  is 
sufficiently  strong  it  will  be  recognized  at  a shorter  time 
Interval  than  when  it  is  barely  above  the  normal  threshold. 

Now  when  we  feed  our  two  pulses,  not  to  one  ear  but 
the  first  one  to  the  left  ear  and  the  second  one  to  the  right 
ear,  we  find  that  for  really  short  time  Intervals  the  situation 
is  essentially  unchanged.  Our  observer  will  tell  us  that  there 
is  only  a single  sound  and  he  will  localize  it  towards  the  side 
of  his  left  ear  (the  one  that  was  presented  the  first  pulse). 
This  "early-arrlval"  effect  permits  us  to  give  a reasonable 
explanation  for  man's  ability  to  localize  sound  sources  even 
in  rather  reverberant  rooms.  We  can,  of  course,  upset  this 
effect  or  at  least  neutralize  it  by  modifying  intensity  rela- 
tions or  by  adding  a pair  of  pulses  for  the  purpose  of  revers- 
ing the  initial  localization.  On  the  whole,  things  work  out 
pretty  well  and  the  average  man  is  able  to  make  reasonably 
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good  localizations  for  time  Intervals  as  short  as  12  micro- 
seconds* (corresponding  to  a distance  difference  In  air  of 
something  like  0.4  cm!). 


* R.  G.  Klumpp  64/  has  reported  some  recent  mea- 
surements of  dlscrlmlnablllty  of  Interaural 
time  difference . The  abstract  of  his  paper  reads 
as  follows:  "Using  earphone  presentation  and 

electrical  lag  lines  which  allow  precise  control 
of  time  delays,  measurements  were  made  of  Inter- 
aural time  difference  thresholds  for  pure  tones 
and  noise  bands . The  time  difference  threshold 
Is  here  defined  as  the  minimal  change  In  Interaural 
time  difference  which  elicits  a perceptible  shift 
In  the  location  of  the  sound  Image.  Results 
present  the  discrimination  threshold  for  Interaural 
time  delay  as  a function  of  the  psychophysical 
method  of  measurement,  bandwidth  and  center  fre- 
quency of  the  signal.  Under  optimum  conditions, 
the  mean  Interaural  time  difference  threshold  for 
the  five  best  listeners  Is  about  six  microseconds. 
Under  similar  conditions  the  mean  threshold  for 
23  listeners  Is  about  12  microseconds." 
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Masking . Rarely,  except  In  laboratory  situations,  do 
we  listen  to  sounds  In  perfect  quiet,  l.e.,  at  a noise  level 
that  lies  below  our  absolute  threshold.  Most  of  the  time 
we  are  faced  with  sounds  that  Interfere  with  our  reception 
of  the  ’’signal",  l.e.,  of  the  acoustic  stimuli  we  are  trying 
to  discriminate.  This  interference  with  auditory  communica- 
tion by  simultaneous  stimuli  Is  called  masking . The  American 
Standard  Acoustical  Terminology  defines  masking  In  a narrower 
sense  as  "the  amount  by  which  the  threshold  of  audibility  of 
a sound  Is  raised  by  the  presence  of  another  (masking)  sound. 
The  unit  customarily  used  Is  the  decibel’.’  This  definition 
Is  clearly  restrictive  in  that  It  excludes  a large  number  of 
effects  to  which  a masking  sound  gives  rise.  It  Is,  however, 
a realistically  restrictive  definition  In  the  sense  that  we 
have  little  reliable  quantitative  evidence  on  effects  other 
than  threshold  shifts. 

What  other  effects  are  there  besides  the  raising  of  the 
threshold?  Practically  all  the  psycho-acoustic  functions  we 
have  discussed  up  to  this  point  are  affected.  The  presence 
of  a masking  sound  affects  the  way  In  which  we  would  scale  the 
loudness  of  a signal  that  we  are  nevertheless  able  to  detect*; 

It  changes  the  pitch  of  a tone  In  a manner  that  is  not  easily 
quantified  58,39/ ; It  modifies  the  size  of  the  Just  noticeable 
differences**  and,  finally.  It  has  an  effect  upon  localization. 
It  is  safe  to  assume  that  a masking  noise  will  Interfere  with 
what  we  might  call  the  temporal  resolving  power  of  the 
auditory  mechanism,  l.e.,  man's  ability  to  tell  that  two  acousti 
signals  are  separated  In  time.  Almost  all  these  effects  are  In 
need  of  further  investigation  but  their  existence  Is  too  well 
established  to  be  subject  to  doubt. 


The  loudness  of  pure  tones  In  the  presence  of 
noise  has  been  investigated  by  Steinberg  and  Gardner  65/> 
while  Pollack  66/  has  reported  how  the  loudness  of 
speech  Is  affected  by  a background  noise . 

Harris  67/  has  shown,  for  Instance,  that  frequency 
discrimination  suffers  when  the  signal  Is  not  far 
above  Its  masked  threshold.  The  situation  improves 
radically  for  signals  more  than  15  db  above  their 
masked  threshold. 
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It  is  obvious  that  the  amount  of  Interference  or 
Interaction  between  two  simultaneous  stimuli  (the  masked 
stimulus  and  the  masking  stimulus)  will  depend  very  much 
on  what  these  stimuli  are.  Presumably  two  pure  tones  will 
behave  differently  than  a combination  of  a pure  tone  with 
wlde-band  noise.  Finally,  stimuli  having  different  temporal 
characteristics  will  Interact  in  still  a different  fashion. 

As  we  shall  see  in  the  following  paragraphs,  the  story  of 
masking  is  known  only  in  bits  and  pieces.  However,  these 
fragments  fit  together  better  than  in  most  other  areas  of 
psycho-acoustics,  and  in  principle  at  least  we  are  able  to 
predict  rather  well  those  effects  of  masking  which  deal  with 
threshold  shifts.  As  the  presence  of  audible  acoustic  noise 
becomes  a common  occurrence  in  our  lives  we  shall  need  to 
know  more  about  these  other  effects  of  masking  we  described 
above . 

What  is  a common  example  of  masking?  Imagine  that  you 
are  sitting  in  your  living  room  during  a spring  evening.  The 
windows  are  open  and  you  can  make  oUt  the  traffic  noises  from 
a nearby  highway.  However,  these  noises  hardly  Interfere  with 
your  conversation,  which  rolls  along  effortlessly.  Suddenly 
you  hear  an  approaching  airplane.  You  check  your  watch.  He 
is  on  time  tonight.  As  the  plane  approaches,  you  and  your 
friends  have  to  Increase  your  vocal  output  if  you  want  to  be 
understood,  unless  you  want  to  wait  until  the  plane  has 
passed.  The  fact  that  you  have  heard  the  noise  of  the  plane 
hundreds  of  times  before  does  not  alter  the  fact  that  you 
still  have  to  raise  your  voice.  (The  speech  sounds  are  the 
masked  signal  while  the  airplane  noise  is  the  masking  signal 
or  the  masker . ) 


Figure  16.13 

The  masking  of  a tone  (secondary  tone)  by  a 
primary  tone  of  80  db  SPL  and  frequency  1200  cps . 

The  ordinate  gives  the  level  of  the  secondary  tone 
in  decibels  above  the  threshold  level  in  quiet. 

The  abscissa  is  the  frequency  of  the  secondary  tone. 
Regions  in  which  various  combination  s of  tones  are 
perceived  are  noted  on  the  graph.  (From  Wegel  and 
Lane  68/. 
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How  shall  we  describe  this  masking  effect?  Let  us 
think  of  an  analogy.  Imagine  that  you  are  looking  for  a 
flower.  When  It  stands  by  Itself,  It  will  be  easily  dis- 
cernible against  the  background  of  the  earth.  Now  assume 
that  some  grass  has  been  growing  In  the  same  field;  It  Is 
clear  that  the  flower  will  have  to  have  a certain  height 
to  be  visible . It  will  also  help  If  the  color  and  the  shape 
of  the  flower  are  different  from  that  of  the  grass . It  Is 
further  reasonably  obvious  that  only  the  grass  that  grows 
right  .In  the  vicinity  of  the  flower  will  Interfere  with  our 
seeing  the  flower,  provided  the  grass  Is  not  very  tall.  It 
will  certainly  make  a difference  on  what  side  of  the  flower 
the  grass  grows  If  our  field  Is  not  uniformly  Illuminated, 
l.e..  If  we  have  light  and  dark  patches.  It  will  help  fur- 
ther If  we  know  the  flower  we  are  looking  for  Instead  of  being 
told  "go,  look  for  a flower  In  the  lawn".  As  you  read  the 
material  on  masking  that  follows,  especially  the  sections  that 
deal  with  Intelligibility  of  speech  In  the  presence  of  noise, 
you  will  see  for  yourself  how  analogous  these  two  situations 
really  are . 

The  classical  experiments  on  masking  were  performed  In 
the  early  1920 *s  at  the  Bell  Telephone  Laboratories  68/. 

These  experiments  Involved  pure  tones  as  both  masking  and 
masked  stimuli.  They  covered  a wide  range  of  frequencies  and 
Intensities,  but  there  was  apparently  only  one  subject  Invglved. 
Figures  I6.13  and  l6.l4  Illustrate  some  of  the  phenomena  that 
are  encountered  when  pure  tones  are  used  as  both  the  primary 
and  secondary  stimuli . 

How  does  a masking  experiment  proceed?  A subject  Is 
presented  a stimulus  situation  In  which  the  signal  Is  sub- 
merged In  the  "noise",  l.e.,  the  masker  swamps  the  masked 
stimulus . As  the  Intensity  of  the  masked  stimulus  Is  slowly 
Increased,  there  comes  a point  where  the  subject  Is  able  to 
detect  a change . This  change  may  be  attributable  to  any  one 


Figure  16. 14 

The  masking  of  a tone  by  a primary  tone  of 
frequency  800  cps . The  parameter  Is  the  sensa- 
tion level  of  the  primary  tone  In  decibels.  The 
masking  In  decibels  Is  plotted  as  a function  of 
the  frequency  of  the  secondary  tones . (From 
Wegel  and  Lane  68/. 
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of  the  various  masking  effects  that  we  have  described  above, 
l.e..  It  may  be  reported  as  a change  In  loudness,  a change  In 
pitch,  or  even  a change  In  localization.  This  "detection  of 
sinythlng"  criterion  Is  probably  the  safest  we  can  adopt.  It 
points  strongly  to  a parallel  between  experiments  on  masking 
and  experiments  on  JND's. 

The  main  conclusions  to  be  drawn  from  masking  experi- 
ments with  pure  tones  can  be  summarized  briefly  as  follows; 

(l)  there  Is  more  masking  between  tones  that  lie  close  to  each 
other  In  frequency  than  for  tones  that  are  further  apart.  (2) 
a low  frequency  stimulus  Is  more  effective  In  masking  a high 
frequency  stimulus  than  vice  versa. 

As  the  Intensity  of  the  masker  Is  Increased  the  amount  of 
masking  for  various  frequencies  Increases,  but  the  amount  of 
Increase  depends  upon  the  two  frequencies  Involved  (See  Fig. 
I6.l4).  However,  since  the  masking  of  pure  tones  Is  actually 
complicated  by  such  phenomena  as  beats  and  difference  tones 
It  seems  hardly  appropriate  here  to  go  Into  detail  concerning 
these  second-order  effects.  These  second-order  effects  are 
most  clearly  present  at  harmonics  of  the  fundamental  frequency 
of  the  masker.  This  Is  why  they  have  been  attributed  to  so- 
called  aural  harmonics . The  assumption  was  made  that  at 
sufficiently  high  stimulus  Intensities  the  mechanical  struc- 
tures In  the  ear  would  cease  to  function  In  a linear  fashion, 
and  the  deviations  from  linearity  would  give  rise  to  these 
harmonics.  This  explanation  Is  not  unreasonable,  and  there 
are  undoubtedly  stimulus  Intensities  for  which  the  behavior 
of  the  ossicles  ceases  to^follow  Hooke's  law.  There  are,  how- 
ever, the  findings  of  Bekesy  69/  who  -was  apparently  unable  to 
detect  mechanical  vibrations  along  the  cochlear  partition 
corresponding  to  such  harmonics . There  Is  also  the  fact  that 
these  aural  harmonics  make  themselves  felt  at  Intensities 
that  are  so  near  threshold  values  that  the  assumption  of 
non-llnearlty  Is  not  entirely  convincing.  It  may  well  be 
that  the  nervous  system,  a non-llnear  device  par  excellence, 
Introduces  most  of  the  non-llnearlty,  especially  at  low 
stimulus  Intensities . Experimental  evidence  should  soon 
become  available  to  help  us  decide  the  extent  to  which  each 
of  these  two  explanations  Is  valid. 

Recently,  experiments  by  Egan  and  Hake  70/  have  cleared 
up  some  of  the  peculiarities  of  the  Wegel-Lane  data.  Egan 
and  Hake  used  as  their  masking  stimuli  both  pure  tones  and 
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narrow  bands  of  noise  centered  around  the  corresponding  pure 
tone  frequencies . The  pure  tone  maskers  yield  masking  con- 
tours that  are  very  much  like  the  Wegel-Lane  data.  However, 
the  bands  of  noise  give  rise  to  rather  smooth  masking  contours 
from  which  beats  or  difference  tones  are  absent.  Masking 
patterns  for  the  two  cases  are  shown  In  Fig.  I6.15.  The  two 
main  conclusions  we  quoted  above  are  still  valid,  however. 

The  reader  may  wonder  if  we  have  no  information  concern- 
ing some  more  realistic  masking  situations . He  will  find 
information  on  the  masking  of  speech  In  Sec.  16.4.  We  shall 
now  attempt  to  report  what  happens  when  the  masking  noise  Is 
"broad-band  white  noise",  l.e.,  approximately  a 7OOO  cps  band, 
since  that  represents  the  pass  band  of  good  quality  earphones. 
These  masking  experiments  have  been  subjected  to  a thorough 
theoretical  analysis,  and  Important  concepts  such  as  that  of 
the  critical  band  have  had  their  origin  In  these  experiments . 

When  a 7OOO  cps  wlde-band  noise  is  used  as  a masker  for 
pure  tones  In  the  audible  range  (see  Pig.  I6.I6)  we  find  that 
we  have  to  have  an  overall  SPL  of  about  20  db  before  the 
"masked"  thresholds  for  pure  tones  are  appreciably  different 
from  absolute  thresholds  for  normal  subjects . As  the  level  of 
the  noise  is  gradually  raised  we  find  that  It  affects  first 
the  thresholds  for  tones  In  the  most  sensitive  region,  l.e., 
from  1000  to  4000  cps . Further  increase  in  the  Intensity  of 
the  masking  noise  yields  masking  effects  all  through  the 
frequency  range  (up  to  10,000  cps  and  presumably  above.  If 
accurate  measurements  could  be  made  In  that  region) . However, 
for  any  SPL  of  the  noise,  the  middle  frequencies  show  the 
greatest  amount  of  masking,  l.e.,  the  largest  threshold  shift 
In  db.  For  sufficiently  large  stimulus  intensities  of  the 
noise  (about  100  db  overall  SPL)  the  masked  thresholds  for 
pure  tones  from  100  to  5000  cps  are  approximately  equal  In 
SPL,  a situation  that  resembles  our  findings  for  equal  loud- 
ness contours  (Fig.  16.8). 

If  we  replot  the  data  of  Pig.  16.I6  In  order  to  show 
masking  for  pure  tones  as  a function  of  the  SPL  of  the  noise, 
an  Interesting  fact  emerges.  We  obtain  a family  of  curves 
which,  except  for  an  initial  curved  portion,  are  parallel 
straight  lines  of  slope  1 (Fig.  I6.17).  This  means  that  for 
each  additional  db  of  noise  we  will  get  an  additional  db  of 
masking . 
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Data  such  as  these  In  Pig.  16.17  call  for  a generaliza- 
tion: Is  there  a transformation  that  will  permit  us  to  repre- 
sent the  whole  family  of  curves  by  a single  curve?  This  was 
done  by  Hawkins  and  Stevens  71/  when  they  plotted  amount  of 
masking  M vs  the  effective  level  Z of  the  masking  noise 
(Plg.  16.18).  M Is  simply  the  threshold  shift  for  a pure  tone 
due  to  the  presence  of  the  noise,  while  Z Is  the  number  of  db 
by  which  the  total  energy  In  a critical  band  (see  below) 
exceeds  the  threshold  energy  for  a pure  tone  whose  frequency 
Is  at  the  center  of  the  band.  The  critical  band  concept  has 
been  put  to  so  much  use  (and  perhaps  also  some  misuse)  In  the 
literature  on  masking  that  It  will  be  worthwhile  to  explain 
It  In  some  detail . 

In  the  early  1930 's  Fletcher  and  Munson  40,4l/  were 
interested  In  the  loudness  of  complex  tones  and  In  the  rela- 
tion between  loudness  and  masking.  For  their  experiments 
they  generated  a random  noise  capable  of  producing  approxi- 
mately the  same  amount  of  masking  at  each  frequency.  Most  of 
their  work  was  related  to  a hypothetical  model  of  the  basilar 
membrane.  It  was  -thought  at  the  time  that  much  of  man's 
behavior  In  response  to  acoustic  stimuli  could  be  explained 
in  terms  of  what  was  supposed  to  happen  along  the  basilar 
membrane*.  Whatever  the  validity  of  this  physiological  model 
in  the  light  of  our  present  knowledge,  an  Important  generaliza- 
tion emerged  from  this  physiological-behavioral  cauldron:  the 

concept  of  critical  band  39/.  This  concept  Is  based  upon  the 
double  assumption:  (l)  only  the  frequencies  that  lie  within 


Since  that  period,  Bekesy's  69,72,73/  researches 
have  given  us  a more  realistic  picture  of  what  goes 
on  Inside  the  cochlea. 


Figure  16.15 

Masking  produced  by  two  stimuli,  one  a pure  tone 
of  400  cps  frequency  and  the  other  a narrow  band 
of  noise  centered  at  4l0  cps . The  masking  In 
decibels  Is  plotted  as  a function  of  frequency. 
(From  Egan  and  Hake  10/ . 
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a narrow  band  centered  at  the  frequency  of  the  tone  that  is 
being  masked  contribute  to  the  masking,  and  (2)  when  a tone 
is  Just  audible  against  the  noise,  the  energy  in  the  critical 
band  is  equal  to  the  energy  of  the  tonal  stimulus . 

There  has  been  much  debate  concerning  the  validity  of  the 
concept  of  critical  band.  In  particular,  the  curve  that 
represents  the  width  of  the  critical  band  as  a function  of 
frequency  has  been  under  scrutiny.  Neither  Egan  and  Hake  70/ 
nor  Shafer  and  his  co-authors  7^/  were  able  to  show  perfect 
agreement  with  a previously  published  curve.  However,  one 
ought  to  be  careful  about  interpreting  the  critical  band  con- 
cept too  literally.  Some  have  gone  as  far  as  to  consider  the 
critical  bands  the  contemporary  heirs  to  the  resonating  struc- 
tures in  the  inner  ear  that  throughout  the  last  century  explained 
man's  ability  to  discriminate  pitch. 

Critical  bands  permit  us  to  predict  to  a first  approxi- 
mation the  amount  of  masking  for  pure  tones  in  the  presence  of 
well-behaved  noises,  i.e.,  noises  having  uniform,  smooth 
spectral  and  temporal  characteristics.  The  critical  bands  are 
undoubtedly  related  to  the  general  mechanism  of  frequency- 
discrimination.  However,  the  fact  that  the  Shower -Biddulph 
DL's  for  frequency  are  proportional  to  the  critical  bands  has 
been  overstressed.  Whatever  the  critical  bands  are,  they  do 
not,  as  Bekesy's  data  show,  represent  the  sharpness  of  the 
resonance  curves  of  the  basilar  membrane . 


Figure  l6.l6 

Contours  showing  the  monaural  thresholds  for  pure 
tones  when  masked  by  white  noise  at  various  inten- 
sities . The  parameter  is  the  approximate  level 
of  the  masking  noise . The  sound  pressure  levels 
of  the  pure  tones  are  derived  from  the  calibration 
data  of  an  earphone  into  a 6-cc  cavity.  The  dotted 
portions  indicate  the  nonuniform  earphone  response 
at  high  frequencies.  The  solid  curves  are  corrected 
for  this  nonuniform  response . (Prom  Hawkins  and 
Stevens  7l/. 
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A final  word  concerning  critical  bands  for  one  ear 
versus  critical  bands  for  two  ears:  two  ears  behave 

definitely  ^ they  were  more  sharply  tuned  than  one . 

The  width  of  the  critical  bands  for  one  and  two-ear  listen- 
ing are  shown  in  Pig.  I6.I9.  Hirsh's  findings  on  the  masked 
threshold  for  low-frequency  tones  as  a function  of  Interaural 
phase  would  further  modify  these  critical  band  functions  76/. 

Let  us  be  satisfied  with  the  fact  that  over  a certain  range 
of  man's  behavior,  the  critical  band  model  (a  mathematical 
and  not  a physical  model  or  even  a physiological  "reality”) 
represents  a useful  predictive  device. 

Not  all  the  data  involving  pure  tones  and  noise  deal 
with  steady  pure  tones  and  steady  noise.  Garner  and  Miller 
78/  have  investigated  the  masked  threshold  of  pure  tones  as 
a function  of  duration,  and  Miller  and  Garner  79/  have 
studied  the  masking  of  tones  by  repeated  bursts  of  noise . 

Hirsh,  Rosenbllth  and  Ward  80/  have  investigated  the 
masking  of  brief  clicks  by  pure  tones  and  bands  of  noise. 

In  such  experiments  new  elements  enter  the  situation  involving 
the  temporal  characteristics  of  both  masking  and  masked  stimuli. 
These  experimental  data  exhibited  greater  variability  among 
subjects  with  normal  audiograms  than  is  customary  in  masking 
experiments,  which  are  usually  characterized  by  somewhat  less 
variability  than  absolute  or  differential  threshold  measure- 
ments . 

Stevens  36/  has  described  some  masking  experiments  in 
which  sounds  of  a more  complex  nature  were  used  as  either 
maskers  or  masked  stimuli.  He  generated  a complex,  buzz-like 
tone  by  excitation  of  a single  electrical  resonant  circuit 
with  a periodic  (125  cps)  sequence  of  pulses.  Such  a stimulus 
is  similar  in  nature  to  a vowel  sound,  which  is  generated  by 
periodic  excitation  of  the  resonant  vocal  cavities.  Two  groups 


Figure  16.17 

Relation  between  the  masking  of  pure  tones  of 
various  frequencies  and  the  level  of  the  masking 
noise.  Two  abscissas  are  shown.  One  indicates  the 
overall  SPL  of  the  masking  noise  in  a 7OOO  cps  band, 
and  the  other  Indicates  the  SPL  per  cycle  of  the 
masking  noise.  (From  Hirsh  I9/,  after  Hawkins  and 
Stevens  71/) . 
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of  masking  experiments  were  performed:  (l)  the  buzz  sound 

was  used  to  mask  pure  tones,  and  (2)  pure  tones  and  bands 
of  noise  were  used  to.  mask  the  buzz  sound. 

At  the  beginning  of  this  section,  we  said  that  we 
would  deal  only  with  effects  attributable  to  the  simultaneous 
presence  of  a signal  and  a noise.  Lately  Investigators  have 
lumped  after-effects  of  short  duration  under  the  heading  of 
masking  (see  de  Mare's  "After-Effect  Masking"  8l/  and  Munson 
and  Gardner's  "Residual  Masking"  82/.  Without  desiring  to 
enter  here  Into  a semantic  argument,  let  us  state  that  we 
will  deal  with  those  observations  In  the  next  chapter.  This 
postponement  does  not  mean  that  after-effect  phenomena  are 
any  less  basic;  the  division  of  the  material  Is  merely  a 
matter  of  convenience. 


Figure  16.18 

Relation  between  the  masking  (M)  and  the  effec- 
tive level  (Z)  of  the  masking  noise.  Z Is  the 
number  of  decibels  by  which  the  total  energy 
In  a critical  band  exceeds  the  threshold  energy 
for  a pure  tone  whose  frequency  Is  at  the  center 
of  the  band.  (Prom  Hawkins  and  Stevens  7l/) . 
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1-6.3  Audiometry 

In  its  broader  sense,  audiometry  means  "measurement 
of  hearing"  Usually,  however,  the  term  Is  used  to  describe 
a restricted  kind  of  auditory  measurement,  namely  the  mea- 
surement of  the  absolute  threshold  for  pure  tones  and  for 
speech.  The  Instrument  used  to  perform  this  kind  of  measure- 
ment Is  called  an  audiometer . 

In  Section  16.2  we  discussed  In  some  detail  how 
thresholds  for  pure  tones  can  be  measured  In  the  laboratory. 

We  have  shown  (Fig.  l6.1)  curves  of  minimum  audible  field 
and  minimum  audible  pressure  measured  by  several  Investigators 
In  careful  laboratory  experiments . These  are  the  threshold 
curves  for  young  subjects  with  acute  hearing.  These  thres- 
hold curves  are.  In  a sense,  examples  of  experimental  audio- 
metry In  contrast  to  clinical  audiometry.  Hirsh  19/ 
distinguishes  between  these  two  kinds  of  measurement: 
experimental  audiometry  Is  concerned  with  the  measurement  of 
thresholds  in  terms  of  the  physical  dimensions  of  the  stimulus, 
whereas  clinical  audiometry  has  to  do  with  the  measurement  of 
hearing  loss,  i.e.,  the  difference  between  a particular  thres- 
hold and  a comparable  threshold  norm. 

The  need  often  arises  for  measurements  of  the  hearing  of 
an  Individual  In  a non-laboratory  situation.  To  evaluate  the 
hearing  of  this  Individual,  we  should  like  to  compare  such 
measurements  with  those  for  Individuals  with  normal  hearing. 

The  measurement  Is  most  often  made  In  a room  that  Is  not 
absolutely  quiet,  such  as  a doctor's  office,  where  the  ambient 
noise  is  likely  to  Interfere  somewhat  with  the  measurements . 

The  person  operating  the  audiometer  is  usually  not  skilled  In 
the  techniques  of  psychophysics,  and  the  subjects  are  certainly 
not  trained.  Such  clinical  measurements  clearly  fall  Into  a 
class  entirely  different  from  the  laboratory  measurements 
discussed  previously.  We  must  devise  special  techniques  that 


Figure  16.19 

Critical  bands  for  one-  and  two-ear  listening. 
The  left-hand  ordinate  gives  the  bandwidth  In 
decibels,  as  10  logxo  ^ The  right -hsind 

ordinate  Is  a f In  cps . (From  Beranek 
after  French  and  Steinberg  77/) • 
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are  suited  to  the  clinical  situation  In  order  to  be  sure 
that  meaningful  results  can  be  obtained. 

To  the  otologist  and  audiologist,  audiometry  Is  a 
useful  aid  In  the  detection,  diagnosis  and  correction  of 
diseases  of  the  ear.  The  usual  type  of  measurement  consists 
In  determining  the  absolute  threshold  for  pure  tones.  The 
data  are  plotted  as  hearing  loss  relative  to  the  threshold 
norm.  Often,  however.  It  Is  necessary  to  measure  the  amount 
by  which  the  hearing  of  an  individual  differs  from  normal 
hearing  In  order  to  evaluate  the  extent  to  which  his  loss  of 
effectiveness  as  a member  of  society  is  influenced.  There 
are  many  ways  In  which  partial  or  complete  loss  of  hearing 
may  adversely  Influence  our  effectiveness  In  society.  Our 
ability  to  understand  speech  may  be  impaired;  our  ability  to 
perceive  distance  and  direction  may  be  influenced;  our  enjoy- 
ment of  music  and  other  sounds  may  be  affected;  our  well- 
being or  the  well-being  of  others  may  be  Influenced  by  our 
Inability  to  hear  sounds  of  warning  or  distress. 

Classically  the  accepted  audiometric  technique  has 
been  the  measurement  of  the  absolute  threshold  for  pure  tones 
relative  to  that  of  a normal  population.  Before  the  advent  of 
the  electronic  audiometer,  tuning  forks  and  tests  Involving 
spoken  and  whispered  speech  were  often  used  to  measure  hearing 
acuity.  It  Is  reasonably  obvious  that  our  ability  to  perceive 
speech  Is  related  to  the  amount  of  hearing  loss  for  pure  tones. 
However,  the  exact  form  of  this  relation  Is  not  completely 
understood  at  present . It  would  lead  us  too  far  astray  to  go 
into  the  reasons  for  this  difficulty  at  this  point . Suffice 
It  to  say  that  a given  amount  of  hearing  loss  may  affect 
speech  perception  very  differently  depending  upon  the  nature 
of  the  loss  19/.  (See  below  for  a discussion  of  various  kinds 
of  loss) . 

Fletcher  (1929  83/  and  1953  84/)  and  Carhart  85/  have 
derived  relations  between  the  pure  tone  threshold  and  the 
reception  of  speech.  In  1942  The  American  Medical  Association 
sponsored  a method  for  calculating  the  "percentage  hearing  loss 
for  speech”  from  the  audiogram.  This  method  has  been  used 
widely  In  recent  years,  but  Its  nature  has  been  misunderstood 
by  some  who  have  used  It.  At  present,  an  attempt  Is  being 
made  to  define  medico-legal  standards  In  which  hearing  loss  for 
pure  tones  at  certain  frequencies  is  related  to  a measure  of 
socially  useful  hearing  for  speech. 
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Recently,  as  psycho-acousticians  have  developed 
quantitative  techniques  for  testing  the  Intelligibility  of 
speech,  considerable  emphasis  has  been  placed  on  speech 
audiometry.  Many  Investigators  believe  that  auditory  mea- 
surements that  Involve  speech  stimuli  directly  are  more  closely 
correlatable  with  social  adequacy  86/  and  other  related  factors 
than  are  pure  tone  measurements. 

The  use  of  speech  as  the  test  material  has  certain 
disadvantages,  since  the  Intelligence,  training,  and  even 
regional  background  of  the  subject  plays  a significant  role 
In  determining  his  response  to  speech  Items.  Perhaps  an  audio- 
metric  technique  which  measures  the  perception  and  discrimina- 
tion of  speech-llke  sounds  (such  as  damped  waves,  clicks,  bands 
of  noise)  would  constitute  a meaningful  and  at  the  same  time 
practicable  method  for  measuring  hearing  loss . 

The  discussion  which  follows  Is  limited  chiefly  to  pure- 
tone  alr-conductlon  audiometry  and  speech  audiometry.  The 
results  of  some  large-scale  audlometrlc  surveys  follow  the  sec- 
tion on  pure-tone  audiometry,  since  those  surveys  were  helpful 
In  establishing  the  normal  audiometer  threshold  and  In  estima- 
ting the  expected  deviations  from  this  threshold.  In  particular, 
the  shift  In  the  audiometer  threshold  with  age  Is  discussed  In 
some  detail . 

Audlometrlc  procedures  that  are  useful  only  In  the 
diagnosis  of  hearing  defects  and  In  the  screening  of  groups  of 
subjects  are  not  emphasized  since,  for  the  purposes  of  this 
handbook,  they  are  of  little  Interest. 

Pure -Tone  Audiometry.  The  hearing  loss  for  pure  tones* 

Is  generally  measured  at  each  of  several  frequencies,  usually 


The  discussion  In  this  section  Is  restricted  to 
the  determination  of  thresholds  for  air  conduction 
for  which  the  stimulus  Is  applied  by  means  of  an 
earphone.  In  the  diagnosis  of  certain  hearing  de- 
fects, bone -conduction  thresholds  are  occasionally 
used.  Skilled  otolglsts  can  use  these  tests  to 
good  advantage  for  purposes  of  diagnosis . These 
test  results  are,  however,  still  more  difficult  to 
Interpret  In  terms  of  the  understanding  of  a speech 
than  the  results  of  pure  tone  audiometry  using  air 
conduction . 
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covering  the  range  from  125  to  8000  cps.  Hearing  loss  is 
expressed  as  the  number  of  decibels  by  which  the  intensity  of 
a pure  tone  must  be  raised  above  a "normal"  threshold  intenslly 
in  order  that  it  will  elicit  a response  from  an  observer.  Ear- 
phones are  invariably  used  to  present  the  stimulus  in  turn  to 
each  ear  of  the  observer. 

In  Section  l6.2  a discussion  is  presented  of  the  absolute 
threshold  of  hearing  as  determined  from  laboratory  measurements. 
Two  types  of  threshold  are  defined:  a minimum  audible  field 

(MAF),  in  which  the  sound  pressure  is  measured  at  the  location 
of  the  center  of  the  head  in  a free -field  before  the  head  is 
Inserted  in  the  field,  and  a minimum  audible  pressure  (MAP)  in 
which  the  sound  pressure  is  measured  at  the  eardrum  by  means 
of  a probe  tube.  The  MAP  and  MAP  threshold  curves  are  shown 
in  Pig.  l6.1.  Neither  one  of  these  curves  is  used  as  the 
"normal"  threshold  for  an  audiometer. 

Usually  audiometric  measurements  are  not  made  under  ideal 
laboratory  conditions,  where  great  care  can  be  taken  to  minimize 
the  ambient  noise  level  and  to  use  well-trained  subjects.  In 
many  spaces  where  audiometers  are  used  the  ambient  noise  level 
is  sufficiently  high  at  low  frequencies  to  mask  the  low  fre- 
quency tones  generated  by  the  audiometer  near  the  threshold 
level.  The  masking  at  medium  and  high  frequencies  (above  500 
cps)  is  usually  small,  however,  since  the  spectrum  of  the 
ambient  noise  drops  off  rapidly  at  high  frequencies . Another 
difficulty  in  clinical  audiometry  is  that  probe-tube  calibra- 
tions are  difficult  to  perform,  and  are  subject  to  some 
variability  from  observer  to  observer  because  of  differences 
in  fit  of  the  earphone  and  differences  in. ear  canal  dimensions. 
Consequently,  the  "normal"  audiometer  threshold  is  defined  in 
terms  of  the  sound  pressure  developed  by  the  earphone  in  a 
standard  coupler  when  the  audiometer  is  set  at  zero  decibels 
hearing  loss . 

In  its  standards  on  "Minimum  Requirements  for  Acceptable 
Pure-Tone  Audiometers  for  Diagnostic  Purposes",  the  American 
Medical  Association  has  specified  a National  Bureau  of  Standards 
Coupler  9-A  or  its  acoustic  equivalent  87/,  and  has  specified 
the  threshold  sound  pressures  developed  in  that  coupler  for 
several  types  of  earphones.  Por  a Western  Electric  Type  705-A 
earphone,  the  threshold  pressures  are  given  in  Table  I6.I.  Por 
comparison,  the  MAP  and  MAP  data  of  Slvlan  and  White  1/  are 
also  shown  in  the  Table..  (It  is  interesting  to  note  that  the 
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TABLE  16.1 


AMA  STANDARD  THRESHOLD  SOUND  PRESSURES  FOR  AUDIOMETERS  USING 
W.  E.  TYPE  705-A  RECEIVER.  MAF  AND  MAP  THRESHOLDS  OF  SIVIAN 
AND  WHITE  ARE  SHOWN  FOR  COMPARISON 


Frequency 

cps 

Threshold  Sound 

Pressure  In  db  re 

1 dyne/cm^ 

AMA  Standard 

MAF 

MAP 

125 

-19.5 

-47 

-33 

250 

-34.4 

-58 

-49 

500 

-49.2 

-67 

-60 

1000 

-57.3 

-70 

-67 

2000 

-57.0 

-76 

-69 

4000 

-58.9 

-80 

-66 

8000 

-53.1 

-64 

-55 

difference  between  the  MAP  and  the  AMA  standard  is  as  much  as 
27  db  at  low  frequencies).  The  normal  threshold  specified  in 
this  fashion  by  the  American  Medical  Association  was  established 
from  measurements  of  thresholds  for  a large  number  of  ears  of 
persons  In  the  age  group  of  I8  to  30  years  inclusive* . 

The  hearing  loss,  therefore.  Is  the  number  of  decibels  by 
which  the  Intensity  of  a pure  tone  must  be  raised  above  the 
normal  threshold  intensity  defined  In  Table  16.I.  In  a sense, 
the  pure-tone  hearing  loss  of  an  Individual  Is  equal  to  the 
sensation  level  of  the  pure  tone  for  an  AMA  standard  observer. 
The  data  for  a given  subject  are  usually  presented  in  the  form 
of  an  audiogram,  which  Is  a plot  of  the  hearing  loss.  In 


* A further  discussion  of  the  problems  Involved  In 
establishing  a normal  threshold  for  audiometers  Is 
given  later  under  the  effects  of  age . 
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decibels,  as  a function  of  frequency.  A typical  audiogram 
is  shovm  in  Pig.  16.20. 

Users  of  audiometers  employ  many  different  techniques 
for  the  measurement  of  hearing  loss,  and  at  present  there 
are  no  accepted  standard  procedures.  For  example,  the  defini- 
tion of  hearing  loss  given  at  the  beginning  of  this  section 
falls  to  specify  how  the  pure-tone  stimulus  is  presented  to 
the  observer  during  the  test,  or  how  the  response  of  the 
observer  Is  measured  and  evaluated.  Questions  of  instrumenta- 
tion, environment  and  psychophysical  procedure  are  discussed 
below. 

The  stimulus  is  generated  by  a pure-tone  audiometer, 
which  comprises  three  units:  (l)  an  electronic  oscillator 

for  generating  alternating  electric  currents  of  the  desired 
frequencies-,  (2)  an  amplifier  with  an  attenuator,  and  (3)  an 
earphone  for  applying  the  sound  to  the  listener's  ear.  The 
sound  pressure  level  of  the  sound  Is  adjusted  by  means  of  the 
attenuator  dial,  which  is  calibrated  In  units  of  hearing  loss 
(decibels).  The  frequency  of  the  sound  Is  selected  by  means 
of  another  control  which  is  marked  In  cycles  per  second.  An 
audiometer  should  be  equipped  with  a switch  for  turning  the 
tone  on  and  off  without  Introducing  audible  clicks . 

The  Council  of  Physical  Medicine  and  Rehabilitation  of 
the  American  Medical  Association  and  the  American  Standards 
Association  have  written  specifications  on  minimum  require- 
ments for  acceptable  pure-tone  audiometers  for  diagnostic 
purposes.  The  "normal"  threshold  sound  pressures  contained 
In  these  specifications  have  been  tabulated  above . In  addition 
the  requirements  specify  test  frequencies.  Intensity  Intervals 
and  ranges,  as  well  as  acceptable  harmonic  distortion  and 
noise  present  In  the  signal.  The  Standards  also  provide  a list 
of  accepted  commercial  audiometers . 

Testing  techniques  for  the  measurement  of  the  hearing 
loss  for  pure  tones  have  been  suggested  by  several  Investigators, 


Figure  16.20 

A typical  audiogram.  The  hearing  loss  In  decibels 
Is  plotted  at  each  of  several  frequencies . 
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and  a review  of  several  procedures  has  been  presented  by 
Hirsh  19/.  Reger  88/  has  given  a discussion  of  some  of  the 
factors  which  influence  the  accuracy, of  the  measurement,  and 
has  proposed  a standardized  technique  which  is  fairly  close 
to  that  already  followed  by  many  users  of  audiometers. 

Most  of  the  factors  which  influence  the  accuracy  of 
auditory  thresholds  may  be  classified  under  the  following 
three  subdivisions:  (l)  variables  associated  with  the 

environment,  (2)  variables  associated  with  the  subject,  and 
(3)  variables  Introduced  by  the  user  of  the  audiometer. 

Of  the  environmental  factors,  the  level  of  the  ambient 
noise  in  the  testing  room  is  usually  the  most  difficult  to 
control.  It  is  essential  to  maintain  low  noise  level  in  the 
room.  The  noise  usually  present  in  a building  interferes 
primarily  with  the  detection  of  the  low-frequency  tones,  and 
does  not  appreciably  affect  detection  of  the  high  frequencies. 
The  noise  level  would  be  considered  sufficiently  low  if  an 
individual  with  known  normal  hearing  were  able  to  perceive 
each  frequency  at  the  zero  level  on  the  audiometer. 

Suppose  we  wish  to  assess  the  hearing  loss  of  an 
individual  or  group  of  individuals  on  the  basis  of  measure- 
ments with  a given  audiometer  in  a given  environment . In  order 
to  Interpret  our  data  properly,  we  should  like  to  know  how  the 
normal  population  would  test  with  the  same  audiometer  and  the 
same  environment.  In  practice,  audiometric  data  should  be 
obtained  for  a control  group  that  is  believed  to  be  representa- 
tive of  the  normal  population.  (See  Williams  and  Cox  89/). 

Other  variables  associated  with  the  environment  are  those 
which  Influence  the  body  comfort  and  alertness  of  the  observer, 
and  Influence  his  ability  to  relax  during  the  test.  These 
include  the  temperature,  humidity,  and  barometric  pressure  or 
altitude  of  the  test  room,  and  the  time  of  day.  The  earphone 
should  be  well-fitting,  but  should  not  press  so  hard  against 
the  observer's  ear  that  it  is  uncomfortable. 

Psychological  factors  associated  with  the  subject  include 
his  intelligence,  reaction  time  and  previous  test  experience, 
his  physiological  condition  and  his  attitude  toward  the  test. 
Additional  test  difficulties  are  presented  by  subjects  who 
experience  head  noises  such  as  tinnitus.  Malingering  and 
psychogenic  disorders  fall  into  still  another  category. 
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The  training,  experience,  insight  and  personality  of 
the  person  administering  the  test  are  important  factors . Of 
primary  importance  are  the  techniques  he  uses  to  present  the 
stimulus  and  his  evaluation  of  the  subject's  response.  For 
example,  the  stimulus  may  be  presented  as  a continuous  tone 
or  as  a tone  Interrupted  at  various  repetition  rates,  and  the 
intensity  of  the  tone  may  be  either  increased  or  decreased 
gradually  during  determination  of  the  threshold.  The  value 
of  the  threshold  may  be  markedly  influenced  by  these  and 
other  factors . 

First  let  us  examine  the  general  psychophysical  pro- 
cedures that  are  available  for  the  measurement  of  the  auditory 
threshold.  The  method  of  ad.lustment  may  be  used  in  which  the 
observer  himself  adjusts  the  intensity  of  the  stimulus  by 
means  of  an  attenuator  until  the  tone  is  "Just  audible”. 

Usually  the  median  of  several  determinations  of  the  threshold 
with  both  ascending  and  descending  intensity  is  taken  as  repre- 
senting threshold  intensity.  A second  procedure,  known  as  the 
method  of  limits,  is  similar  to  the  method  pf  adjustment  except 
that  (l)  the  exper'imenter,  not  the  observer,  controls  the 
intensity,  and  (2)  the  approaches  from  above  and  below  are 
accomplished  at  a fixed  rate  within  fixed  limits . The  observer 
is  Instructed  to  raise  his  finger  whenever  he  hears  the  tone 
and  to  drop  it  as  soon  as  the  tone  disappears . The  median 
intensity  recorded  for  several  such  determinations  is  taken  as 
representing  threshold  Intensity.  Third,  the  method  of  constant 
stimuli  may  be  utilized.  A number  of  tones  of  different 
intensities  are  presented  to  the  observer,  who  is  asked  to 
respond  "yes”  or  "no”  to  each  tone.  The  responses  are  plotted 
on  a graph  and  the  threshold  intensity  is  usually  taken  as  the 
intensity  at  which  the  subject  gives  "yes"  responses  50^^  of 
the  time . 

The  traditional  clinical  audiometric  technique  is  a 
modification  of  the  method  of  limits,  and  makes  use  of  the 
conventional  audiometer  described  above . The  observer  is 
seated  comfortably  in  such  a position  that  he  can  see  neither 
the  control  panel  of  the  audiometer  nor  the  hands  of  the  operator. 
He  is  Instructed  to  raise  his  finger  or  press  a signal  button 
whenever  he  hears  a tone,  and  to  withdraw  his  finger  or  release 
the  button  whenever  the  tone  disappears . 

The  operator  starts  with  a tone  that  is  well  above 
threshold  and  gradually  reduces  its  intensity  until  the  observer 
Indicates  that  he  no  longer  hears  it . Some  operators  leave  the 
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tone  on  continuously  while  changing  the  Intensity.  Others 
Interrupt  the  tone  either  regularly  or  randomly,  and  adjust 
the  attenuator  (usually  In  steps  of  5 decibels)  only  during 
the  silent  intervals.  Random  Interruption  of  the  tone  is 
believed  by  many  to  be  preferable,  since  in  this  case  the 
subject  does  not  respond  to  a temporal  pattern  and  is  forced 
to  make  Judgments  to  each  stimulus  more  or  less  Independently. 
Once  the  tone  has  become  inaudible,  the  intensity  is  increased 
until  the  observer  again  signals  that  he  hears  the  tone.  The 
mean  of  the  hearing  loss  readings  on  the  audiometer  for  the 
two  directions  of  approach  is  recorded  as  the  correct  hearing 
loss.  The  procedure  is  usually  repeated  several  times,  and 
the  median  of  several  determinations  is  plotted  on  the  audio- 
gram at  the  proper  frequency,  using  a symbol  appropriate  for 
the  ear  being  tested.  The  procedure  is  repeated  for  each  ear 
separately  at  frequencies  of  125,  250,  500,  1000,  2000,  4000 
and  8000  cps,  as  specified  in  the  standards  for  audiometers. 
Formerly,  frequencies  of  128  cps  and  multiples  thereof  were 
used  as  test  stimuli.  However,  it  is  doubtful  if  a frequency 
of  4096  cps  could  really  be  specified  to  more  than  two  signi- 
ficant figures.  Recent  medico-legal  thinking  would  like 
frequencies  of  750,  1500  and  3OOO  cps  to  be  used  in  the  hope 
that  correlation  between  pure-tone  thresholds  and  speech 
audiometry  could  be  Improved  thereby. 

The  above  discussion,  together  with  certain  relevant 
comments  in  Section  16.2,  indicates  that  a standardized  pro- 
cedure for  audlometrlc  testing  is  essential  if  data  on  hearing 
loss  measured  by  different  groups  are  to  be  compared.  For 
example,  if  the  pure-tone  hearing  loss  of  a given  individual 
is  to  be  compared  with  the  average  hearing  loss  for  his  age 
group  (as  obtained  from  one  of  the  national  surveys),  it  would 
be  most  desirable  to  use  the  same  testing  procedure  in  both 
cases.  A standard  should  specify  the  order  in  which  different 
frequencies  are  to  be  presented  to  the  two  ears,  the  intensity 
of  the  tone  at  the  initial  presentation,  and  the  timing  of  the 
stimulus  presentations.  It  should  also  specify  the  response 
of  the  listener  that  defines  the  threshold. 

Speech  Audiometry.  In  the  past,  hearing  loss  for  speech 
was  measured  without  the  benefit  of  a monitoring  meter  or 
other  electronic  Instruments.  A clinician  would  stand,  say 
20'  away  from  a patient,  and  speak  numbers  or  words  for  the 
patient  to  repeat.  The  patient's  hearing  would  be  considered 
satisfactory  if  he  could  repeat  all  the  test  items.  If  he 
could  not,  the  clinician  moved  closer  than  20',  to  a point 
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where  the  patient  could  repeat  the  words  and  numbers 
satisfactorily.  The  distance  then  provided  a measure  of  the 
hearing  acuity  of  the  patient  for  speech.  The  discussion 
that  follows  will  Indicate  that  wide  variability  would  be 
expected  for  tests  conducted  In  such  a manner  because  of  dif- 
ferences In  noise  level.  In  acoustic  characteristics  of  the 
test  room.  In  test  material,  etc.  Reliable  results  can  be 
obtained  only  by  the  proper  use  of  a speech  audiometer. 

In  order  to  perform  acoustic  measurements  of  hearing 
with  speech  as  the  test  material,  we  must  (l)  present  speci- 
fied speech  stimuli  to  the  ear  of  a listener  at  known  Inten- 
sities and  (2)  evaluate  the  response  of  the  listener  to  the 
speech  material . Let  us  first  consider  the  problems  involved 
In  the  generation  of  speech  signals  of  known  intensities . 

The  next  section  (l6.4)  describes  some  of  the  physical 
properties  of  speech,  and  Indicates  that  speech  sounds  have  a 
wide  range  of  Intensities.  There  are  pronounced  fluctuations 
in  the  Instantaneous  speech  Intensity  as  different  speech 
sounds  are  uttered  In  words  or  sentences . Even  the  speech 
Intensity  averaged  over  a syllable  or  word  varies  from  word 
to  word  for  a given  talker,  and  also  varies  markedly  from 
talker  to  talker.  The  Intensity  also  varies  with  distance 
from  the  talker.  Thus,  in  order  to  generate  standard  speech 
material  of  known  and  controlled  Intensity,  we  must  provide  some 
means  for  monitoring  the  Intensity.  This  Is  usually  accomplished 
by  means  of  a meter  that  rectifies  and  averages  the  speech  wave 
over  a fraction  of  a second.  The  Intensity  of  a sequence  of 
test  Items  Is  then  adjusted  to  provide  as  closely  as  possible 
the  same  peak  deflection  of  the  monitoring  meter. 

The  essential  elements  of  a speech  audiometer . are  (l)  a 
source  of  speech,  either  (a)  the  voice  of  the  operator  or  (b) 
a recording,  usually  a phonographic  disc  or  a magnetic  tape, 

(2)  a transducer  appropriate  to  the  source;  i.e.,  a microphone, 
a turntable  and  a phonographic  pickup,  or  a magnetic  tape  play- 
back, (3)  an  amplifier,  (4)  a meter  or  other  device  for  monitor- 
ing the  output  of  the  amplifier  to  a known  or  predetermined 
level,  (5)  an  attenuator,  (6)  a calibrated  earphone  or  earphones. 
Minimal  requirements  for  speech  audiometers  have  been  proposed 
by  the  American  Medical  Association  94/.  These  requirements 
specify  the  acceptable  electrical  characteristics  of  the  com- 
ponents of  the  speech  audiometer.  Including  the  frequency 
response,  distortion  and  noise,  dynamic  range  and  accuracy  of 
calibration. 
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Suppose  we  have  a speech  audiometer  that  satisfies  the 
minimum  requirements,  and  we  are  able,  therefore,  to  present 
to  the  ear  of  a listener  speech  material  of  known  intensity. 

Our  objective  is  to  measure  the  ability  of  the  listener  to 
hear  and  understand  speech.  Two  Important  aspects  of  the 
measurement  must  be  specified  before  we  can  perform  the  test . 
First  the  type  of  test  material  must  be  specified,  and  secondly 
we  have  to  agree  upon  the  method  of  measuring  the  response  of 
the  listener.  These  questions  are  discussed  in  some  detail 
in  Section  l6.4  (in  connection  with  articulation  testing  me- 
thods and  the  effects  of  distortion  upon  Intelligibility), 
and  consequently  only  a few  comments  are  presented  here. 

The  usual  procedure  is  to  present  a sequence  of  test 
items  (which  may  be  sentences,  digits,  spondees,  monosyllabic 
words  or  nonsense  syllables)  to  the  listener,  and  to  ask  him 
to  repeat  or  to  write  down  the  items  he  thinks  he  hears . The 
percentage  of  test  items  recorded  correctly  is  termed  the 
articulation  score . It  is  fairly  obvious  that  as  we  Increase 
the  intensity  of  speech  from  a low  value  toward  a high  level, 
the  articulation  score  will  Increase.  Figure  16.23  19/  shows 
the  relation  between  articulation  score  and  relative  intensity 
for  three  different  types  of  words:  spondees,  unselected 

dissyllables,  and  PB  (phonetically  balanced)  monosyllables  9 /• 
The  abscissas  for  the  three  curves  are  scaled  in  decibels 
relative  to  the  intensity  at  an  articulation  score  of  zero 
percent . We  note  from  the  figure  that  both  the  absolute  value 
of  the  articulation  score  and  the  sensitivity  of  the  score  to 
changes  in  intensity  depend  upon  the  test  material . The 
spondees  give  the  steepest  function,  and  therefore  are 
apparently  the  most  homogeneous  with  respect  to  Intelligibility, 
i.e.,  all  the  spondees  are  almost  equally  intelligible. 

In  clinical  speech  audiometry,  two  types  of  tests  are 
conventionally  used.  These  tests  measure  (l)  the  threshold  of 
intelligibility  or  the  hearing  loss  for  speech,  and  (2)  the 
articulation  score,  or  discrimination  loss.  The  “hearing 
loss  for  speech"  is  the  ratio,  expressed  in  decibels,  of  the 
threshold  for  speech  to  the  normal  threshold  for  speech  for 
the  particular  ear  under  test.  The  threshold  of  Intelligibility 
is  the  intensity  which  yields  an  articulation  score  of  50^.  As 
noted  above,  the  normal  threshoH  depends  upon  the  test  material. 
Spondees  are  frequently  used  in  this  type  of  test  because  the 
articulation  curve  (Plg.  16.23)  is,  as  we  have  seen,  relatively 
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steep.  Spondees  provide,  therefore,  a rather  sensitive 
measure  of  hearing  loss  for  speech*. 

Although  the  hearing  loss  for  speech  usually  provides 
a measure  of  ability  to  hear  speech  at  relatively  low  Inten- 
sities, it  is  not  necessarily  top  strongly  correlated  with 
the  intelligibility  obtained  at  relatively  high  intensities. 

Two  subjects  with  the  same  hearing  loss  for  speech  i.e.,  the 
same  speech  reception  threshold  may  yield  articulation  scores 
which  are  quite  different  when  the  speech  is  presented  at  a 
relatively  high  intensity.  The  articulation  score  for  a 
subject  who  exhibits  a moderate  conductive  hearing  loss  will 
usually  rise  to  about  100^  if  the  speech  intensity  is 
sufficiently  high.  A subject  who  exhibits  a good  deal  of 
hearing  loss  at  high  frequencies  (of  the  type  that  characterizes 
nerve  deafness)  may  not  hear  the  high  frequency  components  of 
speech  at  any  intensity,  and  consequently  his  score  may  approach 
an  asymptotic  value  of  less  than  100%  at  high  speech  intensities. 

It  is  desirable,  therefore,  to  measure  the  "discrimination 
loss"  in  addition  to  the  "hearing  loss  for  speech"  if  a more 
detailed  evaluation  of  an  individual's  hearing  of  speech  is 
required.  By  definition,  the  discrimination  loss  is  the  dif- 
ference, in  percentage  points,  between  the  normal  score  for  an 
articulation  test  and  the  score  for  the  individual  under  testj 
This  form  of  test  is  usually  administered  at  an  acoustic  level 


* The  Central  Institute  for  the  Deaf  in  St.  Louis, 
Missouri  has  prepared  a series  of  word  lists  con- 
sisting of  spondee  words  and  of  PB  words.  The  lists 
have  been  recorded  both  at  constant  level  and  at 
decreasing  level  (to  expedite  threshold  measurement). 
The  tests  consisting  of  spondee  words  are  called 
Auditory  Tests  No.  W-1  and  W-2;  the  PB  lists  com- 
prise Auditory  Test  No.  W-22.  These  tests  are 
available  on  12"  phonograph  records  at  either  33  1/3 
or  78  rpm  from  Technlsonlc  Laboratories,  1201  South 
Brentwood  Boulevard,  St.  Louis,  Missouri.  The 
normal  thresholds  for  these  tests  are;  spondees, 
about  18  to  20  db;  PB's  about  25  db  re  0.0002 
dyne/cm2 . 
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well  above  the  threshold  for  speech,  and  the  test  material 
usually  consists  of  monosyllabic  words.  The  normal  value  of 
discrimination  (or  articulation  score)  for  each  test  must  be 
determined  empirically. 

The  brief  discussion  presented  above  indicates  that  we 
can  devise  relatively  simple  methods  for  the  measurement  of 
hearing  for  speech.  One  or  two  numbers  can  be  used  to  de- 
scribe effectively  the  abilities  of  an  individual  to  hear 
and  reproduce  speech.  These  figures  permit  us  also  to  compare 
the  individual  tested  with  the  "normal"  population.  However, 
these  numbers  mean  little  unless  a complete  description  is 
given  of  the  test  material,  the  method  of  administering  the 
test,  and  the  procedure  for  evaluating  the  response  for  both 
the  normal  population  and  the  Individual  under  test.  It  is 
hoped  that  standards  will  be  set  up  that  will  remove  many  of 
these  uncertainties.  Standard  speech  material  should  be  re- 
corded for  use  with  standard  speech  audiometers . The  normal 
thresholds  and  their  distributions  should  be  determined,  and 
the  recommended  method  for  administering  the  test  should  be 
described  in  detail . Only  if  such  standards  are  established 
can  measures  such  as  the  hearing  loss  for  speech  and  the  dis- 
crimination loss  be  extensively  used  to  evaluate  loss  of 
auditory  function. 

The  Effects  of  Age  Upon  the  Absolute  Threshold  for  Pure 
Tones . Whenever  audlometrlsts  are  Interested  in  assessing  an 
individual's  ability  to  hear,  the  question  of  the  effect  of 
aging  upon  hearing  arises.  Since  the  relation  of  age  and  age- 
related  hearing  loss  occupies  a rather  considerable  place  in 
literature,  it  seems  appropriate  to  discuss  this  matter  in 
some  detail.  As  medico-legal  standards  concerning  the  effects 
of  noise  upon  hearing  are  being  considered  the  problem  of  age 
is  bound  to  gain  in  Importance . 

In  1929  the  first  Important  survey  of  the  effects  of  age 
upon  auditory  sensitivity  was  undertaken.  Up  to  that  time 
there  were  Isolated  observations  that  indicated  that  the  upper 
frequency  limit  of  hearing  decreases  in  old  age.  In  1929 
Bunch  90/  examined  the  hearing  of  more  than  350  patients  at 
Johns  Hopkins  Hospital  who  had  not  given  any  obvious  indication 
of  deafness.  There  were  68  subjects  between  20  and  29  years  of 
age,  70  between  30  and  39j  78  between  40  and  49,  85  between  50 
and  59  and  finally  52  who  were  above  60  years  of  age.  Bunch 
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tested  at  frequencies  between  32  and  16,384*  by  means  of  a 
Western  Electric  Type  lA  audiometer. 

Bunch's  findings  have  been  redrawn  by  Morgan  91/  into 
the  set  of  curves  of  Fig.  l6.21.  These  curves  Illustrate  a 
fact  that  has  been  confirmed  in  all  studies  made  since  I929, 
that  there  Is  a steady  decline  In  the  hearing  acuity  for  the 
higher  frequencies  as  a person  advances  in  age. 

Since  Bunch's  time  there  have  been  several  studies  of 
the  effect  of  age  upon  auditory  acuity.  Most  of  the  data 
have  been  collected  at  World's  or  County  Pairs.  The  study 
of  the  Bell  Telephone  Laboratory  5/  Included  many  thousands 
of  subjects  In  each  age  group**.  Five  frequencies  were  used: 

440,  880,  1760,  3520,  and  7040  cps . The  maximum  hearing  loss 
that  could  be  recorded  for  the  two  highest  frequencies  was 
respectively  46  db  (3520  cps)  and  33  8b  (7040  cps).  The  aver- 
age data  for  the  various  age  groups  are  shown  on  Fig.  16.22. 
While  the  general  trend  of  these  data  Is  not  too  different 
from  those  reported  by  Bunch  or  by  the  U,  S.  Public  Health 
Service  j/,  they  exhibit  an  additional  trend  that  has  since 
been  confirmed:  namely,  that  while  men  In  the  older  age  group 

exhibit  a greater  high  frequency  loss  than  women,  at  low  fre- 
quencies the  hearing  loss  for  men  Is  less  than  that  for  women 
of  the  same  age  group.  While  these  data  are  valuable  they 
still  leave  a certain  number  of  questions  unanswered.  There 
Is,  for  example,  no  good  way  of  estimating  how  representative 
of  the  population  of  this  country  these  subjects  were.  The 
paper  by  Steinberg  et  al  5/  discusses  some  of  the  aspects  with 
respect  to  which  the  sample  was  probably  biased.  The  Pair 
groups  (both  In  New  York  and  San  Francisco)  seemed  to  be  above 
average  In  Intelligence,  amount  of  education  and  economic  status. 
The  sample  also  Included  a larger  than  average  proportion  of 
urban  residents.  There  Is  some  evidence  that  all  these  factors 


We  should  be  rather  cautious  about  accepting  some 
of  these  specifications,  such  as  the  frequency 
specified  for  five  significant  figures  or  the  cali- 
bration of  the  earphone  at  the  higher  frequencies. 

The  data  on  the  subjects ' age  were  based  upon 
estimates  by  attendants. 
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(with  the  possible  exception  of  the  last  one)  tend  to 
present  too  favorable  a picture  of  the  hearing  of  the  general 
population . 

There  are,  however,  other  important  questions  that  can 
be  raised.  Did  the  people  who  had  their  hearing  examined  go 
to  the  test  booths  because  a larger  percentage  than  normal 
among  them  was  aware  of  hearing  Impairment?  Only  about  80^ 
of  the  visitors  who  had  their  hearing  tested  permitted  the 
attendants  to  make  a photographic  record  of  the  results.  Were 
these  80J^  an  unbiased  sample  or  were  people  whose  test  showed 
them  to  have  Impaired  hearing  less  willing  to  have  their  losses 
recorded? 

Many  of  these  questions  and  difficulties  were  either 
answered  or  overcome  at  the  San  Diego  County  Pair  92/.  Here 
hearing  teats  were  administered  to  3666  subjects  at  the  same 
frequencies  that  had  been  used  In  the  Bell  Laboratory's  study. 
Testing  took  place  In  Individual  booths  in  an  ambient  noise 
level  between  35  and  45  db  (as  read  with  network  A of  the 
standard  sound  level  meter).  Each  booth  contained  a headset 
with  donut  cushions . The  softest  test  Item  was  dlscrlminable 
against  the  background  noise  of  the  record.  Careful  Instruc- 
tions and  practice  tests  were  given  the  subjects  before  they 
started  the  part  of  the  test  that  was  actually  scored.  The 
whole  procedure  took  about  eight  minutes  for  each  subject . 

Every  participant's  test  paper  was  retained  together  with  his 
answers  to  questions  concerning  age  group,  sex,  past  exposure 
to  noise,  musical  training  and  a statement  concerning  the  sub- 
ject's awareness  of  hearing  difficulties. 

Table  l6.2  shows  the  mean  and  median  sound  pressure  levels 
at  threshold  for  the  20-29  year  age  groups,  measured  at  San 
Diego.  Also  shown  are  the  monaural  data  of  Sivlan  and  White  ly, 
and  the  data  from  the  World's  Pairs.  The  range  of  sound 
pressure  levels  measurable  on  the  Instrument  at  San  Diego  Is 
also  Indicated.  Since  the  Instrument  has  a limited  range,  there 
are  some  subjects  who  may  either  still  hear  the  softest  tone  or 


Plgure  16.21 

Progressive  hearing  loss  with  Increasing  age,  as 
reported  by  Bunch  90/.  The  audiogram  at  20  years 
of  age  Is  taken  as  a basis  of  comparison  (Prom 
Morgan  91/) . 
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be  unable  to  hear  the  most  Intense  tone  presented.  It  would 
appear  wise,  therefore,  to  use  the  median  values  instead  of 
the  means  If  a measure  of  central  tendency  is  desired.  The 
median  values  have  the  advantage  of  not  reflecting  the  fact 
that  the  stimulus  values  are  truncated  at  both  ends  of  the 
continuum.  The  medians  are  also  insensitive  to  skewness  of 
the  distributions  (See  Sec.  2.2).  The  whole  story,  of  course, 
is  only  told  by  giving  the  complete  distribution  of  thresholds . 


TABLE  16.2 

COMPARISON  OF  THRESHOLD  SOUND  PRESSURE  LEVELS  (IN  DECIBELS  RE 
0.0002  DYNE/CM^)  MEASURED  AT  SAN  DIEGO  AND  WORLD'S  PAIRS  FOR 
THE  20-29  YEAR  GROUP.  THE  SIVIAN-WHITE  DATA  ARE  SHOWN  FOR 

COMPARISON 


San  Diego  20-29  Year 
Group 

Range  of  Instrument 
at  San  Diego 

Slvlan- 

Whlte 

World ' s 

Pairs 

20-29 

Year 

Group 

Frequency 

Mean  SPL 

Median  SPL 

Min.  SPL 

Max . SPL 

MAP  SPL 

Mean  SPL 

cps 

db 

db 

db 

db 

db 

db 

440 

34 

34 

21 

81 

16 

29 

880 

22 

20 

16 

76 

8 

22 

1760 

18 

18 

12 

72 

6 

18 

3520 

21 

18 

11 

71 

8 

20 

7040 

22 

18 

9 

69 

16 

22 

Figure  16.22 

Mean  hearing  loss  in  db  for  men  and  women  in  the 
10-19,  30-39  and  50-59  year  age  groups.  The  data 
were  taken  at  the  World's  Fair  hearing  tests. 
(Prom  Steinberg,  Montgomery  and  Gardner  5/)  • 
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We  are  including  these  details  in  a discussion  of  the 
problem  of  threshold  versus  age  to  Illustrate  three  points: 

(1)  one  encounters  numerous  difficulties  in  obtaining  what 
might  be  termed  a "normal  audiogram,"  l.e.,  a reference  level 
that  one  can  use  in  fion-laboratory  situations;  (2)  it  takes 
studies  as  carefully  prepared  and  monitored  as  the  Pair 
studies  or  the  survey  by  the  U.  S.  Public  Health  Service 
Involving  many  subjects  in  order  to  obtain  the  measure  of 
agreement  that  has  been  found*.  Finally,  (3)  we  wish  to  draw 
the  attention  of  those  who  might  make  audlometrlc  studies  in 
the  future  (even  if  they  Involve  only  comparisons  of  pre- 
employment audiograms  with  after-exposure  audiograms)  to  the 
precautions  that  should  be  taken  and  reported  so  that  others 
can  gain  a picture  of  the  reliability  of  their  studies. 

Let  us  now  compare  the  binaural  data  of  the  San  Diego 
Pair  with  the  monaural  data  of  the  New  York  and  San  Francisco 
Fairs  for  all  age  groups.  Both  sets  of  data,  as  shown  in 
Table  16.3j  exhibit  the  same  trends  with  respect  to  age  except 
that  the  San  Diego  population  has  somewhat  more  acute  hearing. 
This  circumstance  might  be  attributable  to  the  binaural  pre- 
sentation of  the  stimuli.  The  only  really  appreciable  reversal 
of  this  consistent  superiority  trend  for  the  San  Diego  popula- 
tion occurs  in  the  20-29  year  age  group.  We  shall  come  back 
to  this  finding  in  Chapter  17  when  the  effects  of  exposure  to 
noise  upon  hearing  is  discussed. 

Other  findings  of  the  San  Diego  survey  are:  (l)  hearing 

loss  in  db  is  not  normally  distributed  for  the  population  as 
a whole,  for  males  and  females  separately,  or  for  any  given 


* Even  then,  it  is  possible  for  a 5 db  difference 
such  as  the  one  found  at  440  cps  to  occur.  The 
authors  of  the  San  Diego  survey  believe  that  this 
discrepancy  should  be  attributed  to  what  they  term 
"experimental  error . " 


Figure  16.23 

Articulation  score  for  three  different  types  of 
speech  material  plotted  as  a function  of  relative 
intensity.  (From  Hudgins  et  al  95/  and  Hirsh  19/) . 
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ten-year  grouping.  Absolute  hearing  loss,  as  well  as  its 
dispersion  and  skewness.  Increases  with  age  and  frequency, 

(2)  individual  differences  in  hearing,  as  measured  by  the  dif- 
ference in  hearing  loss  between  the  5th  and  95th  percentiles 
of  the  population  Increase  from  15  db  for  people  under  30  at 
880  cps  to  over  50  db  (limit  of  this  test)  for  the  older  age 
groups  and  for  the  higher  frequencies . 

When  all  these  data  are  considered  and  when  some  of  the 
evidence  that  has  a particular  bearing  upon  the  relation  of 
hearing  acuity  to  exposure  to  noise  Is  examined  (see  Sec.  17.4), 
we  ask  ourselves  the  question:  do  we  have  enough  reliable 

information  to  establish  a baseline  for  hearing  acuity  in 
various  age  groups  that  is  reasonably  uncontaminated  by  some 
of  the  factors  whose  effect  we  might  want  to  assess? 

Our  answer  can  be  only  a qualified  yes.  There  is  more 
agreement  than  disagreement  betv/een  these  data,  but  we  are 
still  faced  with  a certain  number  of  uncontrolled  variables. 

We  should  certainly  like  to  know  about  the  individual ' s 
otological  history  and  status,  l.e.,  we  should  like  to  be  able 
to  separate  out  those  whose  hearing  loss  is  attributable  to  an 
identifiable  medical  etiology. 

We  should  also  like  to  know  more  about  the  relation 
between  the  "normal  audiogram"  and  the  threshold  curves  that 
one  can  obtain  in  the  laboratory.  What  factors  account  for  the 
10-15  db  differences*?  Is  the  difference  attributable  to  prac- 
tice, temporary  hearing  loss  due  to  previous  exposure,  masking, 
or  a combination  of  all  three?  We  would  feel  better  if  we  could 
show  that  a normal  population  will  reach  the  Slvlan  and  White 
values  if  the  necessary  precautions  are  taken.  The  present  data 
represent  a starting  point.  We  now  need  to  refine  them.  We 
need  a study  in  depth  (including  repeat  audiograms  to  test  for 
reliability)  that  will  take  into  account  all  the  factors  that 
the  past  has  shown  to  be  Important . 

In  the  meantime  we  shall  have  to  base  our  estimates  of 
che  effects  of  exposure  to  noise  upon  the  difference  between 


In  the  younger  age  groups  tested  at  the  San  Diego 
Fair  about  5/^  of  the  population  were  from  5 to  10  db 
better  than  0 db  hearing  loss . 
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the  population  that  Is  being  studied  and  the  above  mentioned 
"baselines,”  unless  the  study  Includes  Its  own  control  group 
§2/. 


The  preceding  discussion  of  what  has  been  called 
"presbycousls, " that  Is,  the  variation  of  the  thresholds  for 
pure  tones  with  age,  by  no  means  exhausts  the  effects  of  age 
upon  performance  In  response  to  acoustic  stimuli.  However, 
so  little  data  exists  on  other  effects  that  we  are  unable  to 
make  even  qualitative  statements  concerning  most  of  them. 

Recently,  however,  another  aspect  of  man's  behavior  In 
response  to  acoustic  stimuli  has  been  measured  as  a function 
of  age  93/.  Subjects  were  Instructed  to  release  the  Index 
finger  from  a telegraph  key  at  the  sounding  of  a loud  click. 

The  mean  reaction  time  for  the  age  group  between  l8  and  39 
years  was  measured  as  122  milliseconds,  while  the  65-75  year 
age  group  (59  subjects)  was  145  milliseconds.  It  Is,  however, 
worth  stating  that  Individual  differences  as  well  as  varia- 
bility In  a given  Individual  Increases  strikingly  with  age. 

This  fact  Is  dramatically  underlined  by  the  finding  that  32$^ 
of  the  65-75  year  age  group  and  155^  of  the  76-86  year  age 
group  have  faster  reaction  times  than  the  average  for  the 
18-39  year  group. 

Relation  of  Hearing  Loss  to  Measures  of  Auditory 
Performance  Other  Than  the  Threshold.  We  have  Indicated  that 
reduced  auditory  performance  Is  usually  measured  In  terms  of 
a rise  In  the  threshold  for  pure  tones  or  for  speech.  It  must 
not  be  assumed,  however,  that  auditory  functions  other  than 
absolute  thresholds  are  not  also  modified.  For  example.  Just 
noticeable  differences  discussed  In  Section  16.2  have  been 
shown  to  be  affected,  as  has  the  accuracy  with  which  people 
localize  sounds.  Ability  to  discriminate  and  to  localize 
sounds  Is  certainly  of  some  Importance  to  us  In  the  performance 
of  our  tasks  and  In  our  social  life . 

Equal  loudness  contours  and  loudness  functions  for  Indivi- 
duals with  non-conduct Ive  type  hearing  loss  (see  next  section) 
differ  markedly  from  the  corresponding  functions  for  the  normal 
population.  For  such  Individuals,  an  elevation  In  the  absolute 
threshold  for  pure  tones  does  not  mean  that  the  equal  loudness 
contours  are  simply  all  shifted  upwards  by  a fixed  amount.  On 
the  contrary,  at  high  stimulus  Intensities  the  loudness  contours 
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may  be  quite  normal*.  The  shape  of  the  function  relating 
loudness  to  stimulus  intensity  often  provides  information 
concerning  the  type  of  hearing  loss  encountered. 

Types  of  Hearing  Loss  19^102/.  It  has  been  customary 
in  the  past  to  recognize  two  fundamental  kinds  of  hearing  loss 
or  deafness:  "conductive"  hearing  loss  and  "perceptive"  hear- 

ing loss.  The  two  types  may  also  exist  in  combination  to 
provide  a "mixed"  type  of  hearing  loss.  Typical  audiograms 
for  persons  with  conductive,  perceptive,  and  mixed  loss  are 
shown  in  Pig.  16.24. 

Conductive  hearing  loss  reflects  a decreased  sound  trans- 
mission to  the  sensory  endings  in  the  inner  ear.  The  ossicular 
chain,  the  eardrum  and  even  the  ear  canal  (wax)  may  be  involved 
in  this  increased  transmission  loss.  This  type  of  deafness  is 
usually  characterized  by  a pure -tone  hearing  loss  that  is  the 
same  order  of  magnitude  at  all  frequencies  (see  Fig.  16.24). 

The  maximum  amount  of  conductive  hearing  loss  is  usually 
not  more  than  50  or  55  dh.  There  clearly  must  be  a limit  to 
the  transmission  loss  measured  for  air  conducted  sound  since 
energy  can  still  reach  the  inner  ear  by  transmission  through 
the  bones  of  the  head,  even  when  there  is  no  transmission  along 
the  normal  air  conduction  route.  In  fact,  the  audiogram  for 
bone -conducted  sounds  is  usually  about  the  same  for  individuals 
with  conductive  hearing  loss  and  for  individuals  with  normal 
hearing.  Conductive  hearing  loss  affects  not  only  the  thres- 
hold but  also  loudness  functions.  Curve  C in  Pig.  16.25 
illustrates  the  modifications  of  an  idealized  loudness  function 
under  the  hypothesis  of  constant  transmission  loss. 

Conductive  hearing  loss  reduces  the  intelligibility  of 
speech,  if  the  speech  stimulus  is  presented  in  the  quiet  at 
low  intensities . However,  in  many  situations  we  are  required 
to  carry  on  a conversation  in  the  presence  of  a masking  noise, 
and  we  must  raise  our  voices  in  order  to  make  ourselves  under- 
stood. The  level  of  both  the  speech  and  the  noise  transmitted 
to  the  ear  will  be  reduced  for  an  individual  with  conductive 
hearing  loss,  but  the  relative  levels,  i.e.,  the  signal -to- 


Por  a more  detailed  discussion  of  the  phenomenon  of 
"recruitment",  see  Fowler  97/ > de  Brulne-Altes  98/, 
Luscher  and  Zwlslockl  99/,  Hulzlng  100, 101/  and 
Hirsh  19/. 
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noise  ratio,  will  remain  unchanged.  Since  the  speech  level 
is  fairly  high,  the  intelligibility  Is  not  limited  by  the 
threshold  of  hearing,  but  is  limited  by  the  level  of  the  mask- 
ing noise.  Thus  a person  with  conductive  deafness  will  per- 
ceive speech  in  a sufficiently  noisy  environment  about  as  well 
as  a person  with  normal  hearing. 

Certain  authors  have  indicated*  that  there  is  a reduction 
in  intelligibility  at  constant  signal -to-noise  ratio  if  both 
signal  and  noise  are  sufficiently  intense  (about  100  db  SPL) . 

If  these  findings  are  correct,  they  would  point  to  an  advantage 
possessed  by  individuals  with  conductive  deafness.  For  these 
individuals,  the  level  of  both  speech  and  noise  is  reduced, 
but  the  signal -to-noise  ratio  remains  constant.  Consequently 
the  Intelligibility  of  speech  in  hlgh-intenslty  noise  would 
be  improved  in  much  the  same  manner  as  for  individuals  wear- 
ing earplugs . 

Perceptive  hearing  loss  is  often  referred  to  as  nerve 
deafness.  Nerve  deafness  is  presumably  related  to  degeneration 
of  some  of  the  sensory  cells  in  the  inner  ear  or  to  some  similar 
process  in  the  auditory  nerve.  The  hearing  loss  for  pure  tones 
is  usually  different  for  diverse  frequencies.  High-frequency 


♦ See  Figure  A2.2,  Appendix  2. 


Figure  16.24 

Median  absolute  hearing  losses  for  various  types 
of  deafness  groups  measured  at  the  San  Diego  County 
Pair.  The  following  criteria  were  used  in  selecting 
the  groups:  (l)  nerve  deafness  group,  no  greater 

than  6 db  hearing  loss  (HL)  on  440,  no  less  than 
1 db  HL  on  7040  provided  the  loss  on  7040  was  at 
least  24  db  greater  than  that  on  440;  (2)  conductive 
deafness  group,  at  least  7 db  HL  on  440,  any  loss  on 
7040  provided  the  loss  on  7040  is  no  more  than  4 db 
greater  than  the  loss  on  440;  (3)  mixed  deafness 
group,  at  least  7 db  HL  on  440,  at  least  21  db  HL  on 
7040  provided  the  loss  on  7040  exceeds  the  loss  on 
440  by  l4  db  or  more.  These  criteria  are,  of 
course,  arbitrary,  and  could  be  open  to  dispute  by 
some  medical  groups . (After  Webster  et  al  103/) . 
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tones  are  the  ones  most  likely  to  be  affected  as  indicated 
by  the  typical  audiogram  in  Pig.  16.24.  The  audiogram  illus- 
trating nerve  deafness  shows  a relatively  gradual  increase  in 
hearing  loss  with  frequency.  Often,  however,  the  transition 
is  rather  abrupt . Hearing  losses  encountered  in  industrial 
populations  exposed  to  sounds  of  abnormally  high  intensities 
resemble  those  defined  as  nerve  or  perceptive  losses . 

Since  the  hearing  loss  at  high  frequencies  is  often 
quite  large,  an  individual  who  is  nerve  deaf  may  not  hear  the 
high-frequency  components  of  a stimulus  regardless  of  the 
stimulus  intensity.  For  example,  the  high-frequency  sounds 
of  speech  would  be  inaudible,  and  the  understanding  of  speech 
would  be  impaired.  The  listener  must  rely  entirely  on  the 
speech  components  at  lower  frequencies  to  make  his  discrimina- 
tions between  speech  sounds.  The  presence  of  a masking  noise 
in  this  low-frequency  range  would  impair  his  ability  to  make 
these  discriminations,  and  his  understanding  of  speech  would 
therefore  deteriorate  rapidly. 

Reference  has  been  made  above  to  the  loudness  contours 
and  loudness  functions  for  persons  with  non-conductlve  hear- 
ing loss.  Prom  the  audiograms  of  Pig.  16.24  it  is  clear  that 
faint  tones  of  high  frequency  may  not  be  heard  by  the  nerve 
deaf,  whereas  they  are  perfectly  audible  to  the  normal  person. 
Certain  nerve -deaf  individuals  have  been  shown  to  hear  hlgh- 
intenslty,  high-frequency  tones  Just  as  loudly  as  "normals." 


Figure  16.25 

The  figure  on  the  left  shows  theoretical  functions 
relating  loudness  to  intensity  level  for  these  cases: 
(a)  normal  hearing;  (B)  40  db  hearing  loss  of  the 
perceptive  type  with  recruitment;  and  (c)  40  db  hear- 
ing loss  of  the  conductive  type  with  no  recruitment. 
(Prom  Steinberg  and  Gardner  65/>  and  Hirsh  I9/) . The 
three  theoretical  loudness  functions  are  redrawn  at 
the  right  as  they  would  appear  as  results  of  al- 
ternate binaural  loudness  matches.  The  three  curves 
show  the  relation  between  sensation  level  on  the  left 
(normal)  ear  and  sensation  level  on  the  right  (patho- 
logic) ear  for  equal  loudness  at  several  levels.  All 
results  are  for  one  frequency.  (From  Hirsh  19/) . 
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There  is  on  a logarithmic  scale  rather  abrupt  transition 
from  hearing  little  or  nothing  at  all  to  hearing  very  well 
(see  Pig.  16.25).  This  phenomenon,  known  as  "recruitment" 

Is  often  very  annoying  to  Individuals  with  non-conductive 
hearing  loss . They  may  have  difficulty  understanding  a 
person  speaking  In  a low  voice,  but  when  he  raises  his  voice 
to  be  understood,  they  complain  that  he  is  shouting  too  loudly 
at  them.  Recruitment  Is  thus  a feature  that  distinguishes  at 
least  certain  types  of  nerve  deafness. 

16.4  Speech  Communication 


Some  of  the  physical  properties  of  the  speech  sounds 
generated  by  a talker  are  discussed  In  this  section.  In  addi- 
tion we  shall  consider  procedures  for  the  measurement  of  the 
Intelligibility  of  speech.  We  shall  also  discuss  the  use  of 
these  procedures  In  evaluating  the  loss  In  Intelligibility  In 
the  presence  of  various  types  of  masking  and  distortion. 

Acoustic  Properties  of  Speech.  The  average  speech  power 
emitted  by  a speaker  at  a conversational  level  lies  between 
10  and  20  microwatts,  when  the  power  is  averaged  over  a, long 
time  Interval,  The  average  power  level  Is,  therefore,  80  to 
83  db,  using  the  definition  of  power  level  given  In  Chapter  15 
and  in  Sec.  2.1.  When  one  talks  as  loudly  as  possible  the 
power  level  may  rise  to  100  db;  for  whispered  speech  the  power 
level  may  be  as  low  as  40  db. 

The  average  sound  pressure  level  30  cm  directly  in 
front  of  the  speaker's  lips  Is  thus  from  66  to  69  decibels 
at  a conversational  level.  At  low  frequencies,  the  radiation 
of  sound  from  the  lips  is  approximately  non-dlrectional,  but 
at  frequencies  above  1000  cps  the  directional  effects  become 
pronounced.  The  sound  pressure  level  Is  then  greatest  on  the 
axis  directly  In  front  of  the  lips,  and  decreases  to  a minimum 
directly  behind  the  head. 

When  the  speech  power  is  averaged  over  a time  interval 
that  Is  short  compared  with  the  time  interval  taken  by  a 
syllable  (about  0.2  sec),  sizeable  fluctuations  In  the  level 
are  observed  as  different  speech  sounds  are  uttered.  For 
example,  the  power  of  the  strongest  vowel  (o)  Is  about  47 
microwatts,  whereas  the  weakest  consonant  (v)  has  a power  of 
only  0.03  microwatts,  on  the  average  83/.  If  the  pauses 
between  sounds  are  excluded,  the  speech  power  levels  measured 
In  l/8-second  Intervals  lie  within  a range  of  about  30  db  104/. 
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A plot  of  the  cumulative  level  distribution  in  the  l/8-second 
Intervals  is  shown  in  Pig.  l6.26.  On  this  plot  the  ordinate 
indicates  the  percentage  of  time  the  speech  power  is  greater 
than  the  level  defined  by  the  abscissa. 

The  distribution  of  speech  power  with  frequency  (averaged 
over  long  time  Intervals)  has  been  measured  by  several  inves- 
tigators 104, 107j 108/.  The  average  speech  spectrum  for  male 
talkers  is  shown  in  Pig.  16.27,  in  which  the  ordinate  represents 
the  sound  pressure  level  for  bands  one  cps  wide  measured  at  a 
distance  of  one  meter  from  the  talker's  lips.  The  form  of  the 
cumulative  level  distribution  in  Pig.  16.26  has  been  shown  to 
be  valid  for  bands  of  speech  as  well  as  for  the  overall  speech 
signal . 

If  the  speech  energy  in  different  frequency  bands  is 
averaged  over  a time  interval  between  0.003  and  0.02  seconds, 
a more  detailed  picture  of  the  changes  in  the  distribution  of 
speech  energy  as  a function  of  time  is  provided.  The  above 
limits  for  time  intervals  are  short  when  compared  to  the  dura- 
tion of  an  individual  speech  sound.  A three-dimensional 
intensity-frequency- time  representation  is  normally  used  IO9, 
110/  to  display  the  data  that  are  obtained.  Pigure  16.28  shows 
one  such  display  in  which  frequency  is  plotted  vertically  and 
time  horizontally,  while  variations  in  Intensity  are  Indicated 
by  shades  of  gray.  Portions  of  the  Intenslty-frequency-tlme 
pattern  are  characterized  by  bars  or  formants  that  change 
their  frequency  positions  as  different  sounds  are  uttered. 

These  formants  denote  resonances  of  the  vocal  tract  similar 
in  nature  to  the  resonances  of  an  organ  pipe.  As  the  vocal 
tract  assumes  new  positions  in  the  production  of  different 
vowel  sounds  the  resonance  pattern  is  changed.  The  Impulsive 
and  noise-like  quality  of  the  consonant  sounds  is  also  observ- 
able on  the  Intenslty-frequency-time  pattern. 

Articulation  Testing  Methods . In  many  applications  the 
presence  of  unwanted  masking  noise  or  some  other  distorting 
influence  modifies  the  speech  signal  that  reaches  the  ears  of 
a listener  and  Impairs  his  ability  to  interpret  correctly  all 
speech  sounds . A quantitative  measure  of  the  efficiency  of 
speech  communication  must  be  established  in  order  to  evaluate 
the  deleterious  effects  of  the  distortions. 
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The  efficiency  of  a speech  communication  link*  is 
usually  evaluated  by  articulation  testing  methods  96/  although 
other  procedures  have  been  used  occasionally  in  the  past . A 
talker  at  the  input  to  the  communication  link  reads  a sequence 
of  test  items  which  may  be  syllables,  words  or  sentences.  At 
the  output,  a listener  is  supposed  to  record  the  test  items 
to  the  best  of  his  ability.  The  percentage  of  correctly  re- 
corded test  items  is  the  articulation  score. 

For  a given  communication  link  the  articulation  score 
depends  on  the  type  of  test  material  used  111/.  The  scores 
for  digits  and  for  sentences  are  higher  than  the  scores  for 
Isolated  words,  which  in  turn  are  higher  than  the  scores  for 
nonsense  syllables.  Figure  16.23  in  Section  16.3  has  illus- 
trated the  marked  differences  in  articulation  scores  that  are 
obtained  for  selected  spondees,  unselected  dissyllables  and 
monosyllables.  Articulation  scores  for  three  different  types 
of  test  material  are  shown  in  Fig.  16.29.  Scores  for  digits, 
words  in  sentences  and  nonsense  syllables  masked  by  white 
noise  are  plotted  as  a function  of  the  signal -to-nolse  ratio. 

A word  spoken  in  isolation  may  not  be  correctly  inter- 
preted by  a listener  if  some  distortion  or  noise  is  present. 
However,  the  same  word  occurring  in  a sentence  may  be  correctly 
recorded  by  the  listener  since  additional  information  to  assist 


The  speech  communication  link  may  be  simply  an 
air  path  from  the  talker’s  mouth  to  the  listener's 
ear  or  it  may  be  a complex  system  with  amplifiers, 
microphones,  loudspeakers,  or  other  components. 


Figure  16.26 

Cumulative  level  distribution  of  speech  in  one- 
elghth  second  long  Intervals.  The  abscissa 
represents  the  SPL  measured  in  eighth-second 
Intervals  relative  to  the  long  time  average  SPL 
in  db.  The  left-hand  ordinate  gives  the  cumulative 
percent  of  the  time  the  level  is  greater  than  the 
level  indicated  by  the  abscissa.  If  the  pauses  in 
speech  are  excluded,  the  right-hand  ordinate  is 
to  be  used.  (After  French  and  Steinberg  105/,  and 
Beranek  106/) . 
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SOUND  PRESSURE  IN  DB  MINUS  LONG  TIME  AVERAGE  SOUND  PRESSURE  IN  DB 
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in  Identification  of  the  word  is  present  in  the  text. 

Similarly,  if  the  listener  knows  in  advance  that  the  word 
is  drawn  from  a restricted  vocabulary  (one  of  the  ten  digits, 
for  example),  he  is  more  likely  to  identify  the  word  correctly. 
Miller,  Heise  and  Lichten  111/  have  made  a detailed  study  of 
the  effect  of  the  context  and  of  the  size  of  the  test 
vocabulary  on  the  percentage  of  items  recorded  correctly  in 
the  presence  of  a given  noise . 

The  experience  of  the  talker  and  listener  also  Influences 
the  results  and  care  should  be  taken  to  Insure  proper  training 
of  test  personnel.  These  and  many  other  practical  details 
that  must  be  considered  in  conducting  articulation  tests  are 
discussed  in  detail  by  Egan  96/. 

Effects  of  Distortion  of  Speech  on  Intelligibility.  Speech 
waves  may  be  modified  artificially  by  many  types  of  distortion 
113/.  Their  amplitudes  may  be  increased  or  decreased;  the 
speech  may  be  masked  by  random  noise  or  by  other  signals.  The 
speech  may  be  passed  through  frequency-selective  circuits; 
non-linear  distortion  such  as  peak  clipping  may  be  applied  to 
the  speech  wave;  or  various  time  distortions  such  as  flutter 
or  compression  or  expansion  of  the  time  scale  may  be  intro- 
duced. Only  the  first  four  of  the  above  types  of  distortion 
will  be  considered  in  this  chapter.  The  others  are  of  Interest 
only  in  specialized  situations  not  normally  encountered  in 
noise  control . 

The  Intensity  of  Speech.  When  the  talker  and  the  listener 
are  linked  by  a communication  system  that  contains  an  amplifier 
with  a variable  gain  (or  when  the  distance  between  talker  and 
listener  in  a free  field  is  variable)  the  intensity  of  the 
speech  reaching  the  listener  may  be  readily  controlled  and  the 


Figure  16.27 

Average  spectrum  level  of  speech  measured  in  one- 
cycle  bands  for  young  male  voices  talking  at  a 
level  six  decibels  below  the  maximum  they  could 
sustain  without  straining  their  voices.  Micro- 
phone placed  one  meter  in  front  of  talkers  in  an 
anecholc  chamber.  One  decibel  has  been  added  to 
remove  the  effect  of  pauses  between  words  in  the 
total  spectrum  level . (After  Clark  et  al  108/ 
and  Beranek  106/) . 
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Figure  16.28 

An  Intenslty-frequency-tlme  representation  of 
the  speech  sample:  "the  sleepy  time  of  day". 

Frequency  Is  plotted  vertically,  and  time  hori- 
zontally. A concentration  of  energy  in  the 
speech  signal  at  a given  frequency  and  time  Is 
depicted  as  a darkening  at  the  corresponding 
point  In  the  pattern.  The  amount  of  darkening 
depends  upon  the  Intensity.  For  the  vowel 
sounds,  the  resonances  show  up  as  horizontal 
bars  on  the  pattern.  High-frequency  energy 
concentrations  typify  such  consonants  as  [s] 
and  [t]. 
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articulation  score  as  a function  of  intensity  may  be  measured. 
As  the  gain  of  the  amplifier  is  increased  from  zero  the 
listener  at  first  reports  the  presence  of  the  speech  signal . 
The  presence  of  about  half  the  words  can  be  detected  when  the 
sound  pressure  level  of  the  speech  is  about  5 dt  re  0.0002 
dyne/cm2.  The  articulation  score  increases  gradually  from 
zero  as  the  speech  intensity  is  increased  above  the  threshold 
of  detectability  (Fig.  I6.30).  The  rate  of  Increase  depends 
upon  the  type  of  test  material  (see  Pig.  16.23).  Eventually 
the  intensity  is  sufficiently  high  that  almost  all  test  items 
are  recorded  correctly  and  the  score  remains  close  to  100  per 
cent  until  the  threshold  of  pain  is  reached.  At  very  high 
intensities  there  is  sometimes  a drop  in  articulation  score 
with  increasing  intensity. 

The  Masking  of  Speech.  In  a great  number  of  military 
and  industrial  situations  individuals  must  talk  to  each  other 
in  the  presence  of  sounds  that  mask  speech  and  reduce  its 
intelligibility.  Several  distinct  types  of  masking  noise  are 
encountered.  The  masking  noise  may  have  a continuous  spectrum, 
and  may  be  continuous  in  time;  it  may  have  periodic  components 
of  one  or  more  frequencies;  or  it  may  have  an  irregular  or 
impulsive  temporal  characteristic . 

Most  laboratory  investigations  have  used  white  noise 
(see  definition  in  Chapter  15 ) as  a masker.  In  such  experiment 
articulation  scores  for  several  types  of  test  vocabulary  have 
been  plotted  (Pig.  16.29)  as  a function  of  the  speech-to-nolse 
ratio.  As  we  have  pointed  out,  the  articulation  score  is 
higher  both  for  words  in  sentences  (in  which  the  context  fur- 
nishes additional  clues)  and  for  special  test  vocabularies 
containing  fewer  items,  such  as  digits. 

/ 

In  most  situations  encountered  in  practice  the  spectrum 
of  the  masking  noise  differs  from  that  of  white  noise.  The 
intelligibility  of  speech  masked  by  bands  of  noise  has  been 
measured  by  Miller  Il4/.  His  results  are  summarized  in 
Fig.  16.31  for  a fixed  speech  level  of  95  db.  They  show  that 
at  low  noise  levels  high  frequency  bands  of  noise  are  more 
effective  maskers  than  are  bands  below  1000  cps . At  high  noise 
levels,  however,  the  low  frequency  bands  become  more  effective. 
This  conclusion  is  Illustrated  clearly  in  Pig.  16.32,  which 
is  re -plotted  from  the  data  of  Fig.  I6.31. 

In  Miller's  experiments  the  subjects  wore  earphones;  the 
speech  and  masking  noise  were  applied  in  phase  to  each  phone. 

In  recent  years  some  attention  has  been  directed  toward  the 
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Influence  of  Interaural  phase  in  binaural  perception.  Hirsh, 
in  particular,  has  shovm  that  for  pure  tones  in  the  range  up 
to  1000  cps  in  the  presence  of  strong  noise,  detectability  of 
a signal  depends  critically  upon  the  phase  relations  across 
the  head  76,115/.  We  cannot  go  into  the  details  of  these 
studies  here,  but  we  would  like  to  illustrate  the  effect  by 
the  following  example . 

Assume  that  we  present  to  a subject  a 200  cps  tone  over 
earphones  in  such  a manner  that  the  200  cps  sine  waves  are 
in  phase  opposition  at  the  two  ears . Then  we  turn  on  a noise 
whose  overall  SPL  is  near  100  db  and  we  assume  the  noise  to 
be  the  same  at  both  ears . Hirsh  measured  the  masked  threshold 
under  these  circumstances  and  he  found  that  it  was  about  5 db 
better  (l.e.  lower)  than  the  monaural  masked  threshold  for  the 
same  200  cps  tone . He  now  reversed  the  phase  of  the  200  cps 
tone  in  one  ear  so  that  the  sine  waves  were  now  in  phase  at 
both  ears.  The  phase  relation  for  the  noise  was  not  altered. 
In  this  new  situation  the  masked  threshold  was  about  13  db 
worse  than  for  the  previous  binaural  presentation.  In  other 
words,  two  ears  were  now  about  8 db  worse  than  one.  This 
effect  is  a function  of  the  intensity  of  the  masking  noise . 

Llcklider  116/  has  examined  the  effects  of  Interaural 
phase  in  speech  perception.  He  has  shown  that  the  phase 
relations  of  the  speech  and  the  noise  at  the  two  ears  affect 
not  only  the  masked  threshold  but  also  the  articulation  scores 
that  are  obtained  for  a given  speech-to-nolse  ratio.  For 
example,  when  either  the  speech  waves  or  the  noise  waves  at 
the  two  ears  are  18Q0  out  of  phase,  word  articulation  scores 
are  as  much  as  25  percent  higher  than  when  both  the  speech 
waves  and  the  noise  waves  at  the  two  ears  are  in  phase.  When 
two  people  converse  in  a room  in  which  masking  noise  is  pre- 
sent, the  fact  that  they  are  able  to  localize  the  source  of 
the  speech  by  orienting  their  heads  helps  them.  If  the  noise 
level  is  rather  high  they  tend  to  turn  one  ear  toward  the 
source,  thereby  taking  advantage  of  the  decreased  distance 


Figure  16.29 

Articulation  scores  for  three  different  types 
of  test  material . The  test  items  were  masked 
by  white  noise,  and  the  percent  items  correct 
are  plotted  as  a function  of  signal-to-nolse 
ratio  in  decibels . (From  Miller,  Heise  and 
Llchten  111/) . 
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from  the  source  to  the  ear  and  the  Interaural  phase 
phenomenon  at  the  same  time.  The  most  advantageous  applica- 
tion of  the  interaural  phase  relation  can  be  made  for  situa- 
tions in  which  the  ambient  noise  is  high  while  the  speech 
signal  comes  over  a pair  of  earphones. 

The  threshold  of  perceptibility  has  been  defined  as 
the  faintest  level  of  speech  at  which  the  gist  of  connected 
discourse  can  be  followed  by  trained  listeners  exerting 
continued  effort.  Stevens,  Miller  and  Truscott  117/  have 
investigated  the  effects  of  pure  and  complex  tones  upon  the 
threshold  of  perceptibility  of  speech.  Figure  16.33  shows 
how  sine  waves,  square  waves  and  pulses  mask  speech.  In 
these  curves  117/  the  masking  is  the  difference  in  decibels 
between  the  threshold  of  perceptibility  of  the  speech  in  the 
presence  and  in  the  absence  of  the  masking  signal . The  rela- 
tive effectiveness  of  the  three  types  of  masking  signals  may 
be  compared  because  the  relative  levels  of  the  masking  signals 
are  Indicated  in  terms  of  the  peak  voltage  Impressed  on  the 
earphones . Sine  waves  of  low  frequencies  mask  speech  more 
effectively  than  high  frequency  tones.  Tones  with  frequencies 
between  300  and  500  cps  are  the  most  effective  masking  stimuli. 
Square  waves  and  pulses  mask  speech  somewhat  more  effectively 
than  sine  waves  since  they  contain  frequency  components  that 
extend  over  a wider  range  of  frequencies.  For  frequencies 
above  1000  cps,  however,  sine  waves,  square  waves  and  pulses 
are  about  equally  ineffective  in  the  masking  of  speech.  In 
a subsequent  section,  we  shall  see  some  of  the  reasons  why 
this  is  so. 

The  masking  effectiveness  of  a noise  is  markedly  dependent 
upon  its  temporal  continuity.  In  general,  the  masking  effec- 
tiveness is  decreased  if  the  masking  noise  is  turned  off 
periodically,  but  the  magnitude  of  this  decrease  is  dependent 
upon  the  fraction  of  time  the  noise  is  on  and  the  number  of 
Interruptions  per  second.  Results  obtained  by  Miller  and 
Llckllder  118/  are  summarized  in  Fig.  l6.3^.  At  slow  rates 
of  interruption  between  1 and  100  per  second,  it  is  possible 
to  patch  together  the  bits  of  speech  heard  between  the  bursts 
of  noise  and,  therefore,  the  masking  effectiveness  of  such 


Figure  16.30 

Articulation  score  for  monosyllabic  words  as 
a function  of  the  SPL  of  the  speech.  (From 
Miller  113/) . 
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noises  Is  rather  low  when  compared  with  continuous  noise. 

For  Interruption  rates  above  200  per  second,  the  masking  Is 
effectively  equivalent  to  that  of  continuous  noise. 

Frequency  Selectivity.  A different  type  of  distortion 
occurs  If  speech  Is  passed  through  a system  that  Is  selective 
with  respect  to  frequency.  If  we  remove  portions  of  the  speech 
spectrum  by  means  of  filters  we  can  perform  articulation  tests 
on  the  remaining  portions  of  the  signal  and  evaluate  the  rela- 
tive Importance  of  different  frequency  bands  to  Intelligibility 
French  and  Steinberg  105/  have  reported  the  results  of  artlcula 
tlon  tests  for  both  male  and  female  speech  signals  passed 
through  various  high  and  low  pass  filters.  Some  of  their  data 
on  syllable  articulation  are  summarized  In  Fig.  16.35. 

The  data  for  low  pass  filters  Indicate  that  there  Is  only 
a small  Increase  In  articulation  score  for  cut-off  frequencies 
above  5000  cps . A low  pass  filter  with  a cut-off  frequency 
of  300  cps  provides  essentially  an  articulation  score  of  zero. 
Likewise,  a filter  passing  all  frequencies  above  300  cps  gives 
an  articulation  score  close  to  100  percent. 

If  we  use  the  relation  between  syllable  articulation  and 
sentence  articulation  (see  Fig.  16.29  for  example)  we  see  that 
a speech  communication  system  passing  either  all  frequencies 
below  or  above  2000  cps  will.  In  the  quiet,  yield  close  to 
100  percent  sentence  Intelligibility.  The  speech  will  not 
sound  natural  but  It  will  be  Intelligible,  and  In  certain  cir- 
cumstances this  may  be  all  that  Is  desired.  Such  a speech 
signal,  having  lost- much  of  Its  redundancy,  will  prove  more 
vulnerable  to  noise  than  a normal  broad  band  signal . 

The  Intelligibility  of  filtered  speech  masked  by  noise 
has  been  measured  by  Pollack  119/.  Likewise,  Egan  and  Wiener 
120/  have  studied  the  Intelligibility  of  narrow  bands  of 
speech  In  the  presence  of  noise.  Pollack's  data  show  that 
the  relative  contribution  of  the  various  speech  frequencies 
to  Intelligibility  In  the  presence  of  noise  Is  not  constant. 


Figure  16.31 

The  articulation  score  for  monosyllabic  words 
as  a function  of  the  SPL  of  the  masking  noise . 
The  different  frequency  bands  of  noise  are  para- 
meters . The  level  of  the  speech  was  held  con- 
stant at  95  db.  (Prom  Miller  114/) . 
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Rather  the  relative  contribution  Is  a function  of  the  Inten- 
sity of  the  speech  signal  relative  to  the  constant  white  noise 
masking  signal.  As  the  SPL  of  the  speech  signal  Is  increased, 
the  relative  contribution  to  Intelligibility  of  the  higher 
speech  frequencies  Increases. 

Amplitude  Distortion.  Amplitude  distortion  Is  commonly 
encountered  in  speech  communication  links  that  include  non- 
linear components  such  as  vacuum  tubes  and  carbon  microphones. 
When  communication  Is  via  air  path  only,  appreciable  amplitude 
distortion  Is  not  encountered,  but  a study  of  the  effects  of 
amplitude  distortion  upon  the  Intelligibility  of  speech  con- 
tributes to  our  understanding  of  the  process  of  speech 
perception. 

Suppose  the  speech  wave  Is  passed  through  a device  that 
limits  the  amplitude  of  the  wave.  Thus  when  the  amplitude  of 
the  speech  wave  at  the  input  Is  equal  to  or  greater  than  a 
fixed  amplitude,  the  output  wave  is  limited  In  amplitude.  The 
number  of  decibels  by  which  the  amplitudes  of  the  peaks  are 
reduced  by  limiting  defines  the  amount  of  peak  clipping.  In- 
finite peak  clipping  would  reduce  the  speech  wave  to  a succes- 
sion of  rectangular  waves.  The  output  would  switch  polarity 
every  time  the  speedh  wave  crosses  the  time  axis.  Llckllder 
121 , 122/  has  measured  the  percent  word  articulation  for  speech 
that  has  been  subjected  to  various  amounts  of  peak  clipping. 

In  his  experiments  he  maintained  a fixed  peak-to-peak  amplitude 
of  the  speech  at  the  earphones.  That  is,  the  speech  wave  re- 
ampllfled  after  clipping  In  order  to  maintain  the  same  peak- 
to-peak  amplitude  that  existed  before  clipping.  Llckllder 's 
results,  shown  in  Pig.  16.36,  Indicate  that  a surprisingly 
large  fraction  of  the  speech  wave  can  be  eliminated  before 
Intelligibility  Is  greatly  affected.  Even  when  the  speech  is 
reduced  to  a succession  of  rectangular  waves,  about  JO  percent 
of  the  words  can  still  be  understood. 


Figure  16.32 

Smoothed  curves  drawn  from  the  data  of  Pig.  l6.3l- 
The  articulation  score  is  shown  as  a function  of 
the  component  frequencies  of  the  masking  noise. 

The  parameter  Is  the  SPL  6f  the  masking  noise . 
(From  Miller  ll4/) . 
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Pollack  124/  has  measured  the  effect  of  frequency  and 
amplitude  distortion  on  the  Intelligibility  of  speech  in 
noise.  His  data  show  that  at  low  signal -to-nolse  ratios, 
the  intelligibility  of  the  clipped  speech  signal  is  con- 
siderably higher  than  that  of  the  undipped  signal . 

Articulation  Index.  It  is  often  desirable  to  predict 
the  Intelligibility  of  speech  passed  through  a speech  trans- 
mission system  without  resorting  to  laborious  articulation 
tests . On  the  basis  of  the  results  of  laboratory  articula- 
tion tests  such  as  those  described  above,  empirical  relations 
between  the  articulation  scores  and  the  pertinent  system 
parameters  (such  as  masking  noise,  or  frequency  characteristics) 
have  been  derived. 

Since  the  articulation  score  depends  on  the  type  of  test 
material  used,  it  is  necessary  to  define  a more  fundamental 
measure  of  articulation  efficiency.  The  definition  of  a new 
quantity,  called  the  articulation  index,  will  be  illustrated 
in  the  following  hypothetical  experiment.  Let  us  divide  the 
audible  frequency  range  into  a number  of  contiguous  frequency 
bands  (say  lO),  with  the  width  of  each  band  adjusted  so  that 
the  articulation  score  of  speech  when  passed  through  each  band 
at  a normal  intensity  level  is  the  same . The  articulation 
index  for  each  band  is  taken  as  10,  and  the  total  articulation 
index  for  all  bands  is  100.  Each  band  of  speech  is,  therefore, 
assumed  to  contribute  Independently  to  the  index  so  that  the 
articulation  index  for  several  bands  together  is  simply  the 
sum  of  the  indices  of  the  Individual  bands . A relation  of  this 
type  does  not  hold  for  the  articulation  score.  For  example, 
the  word  articulation  score  for  the  case  of  nine  of  the  ten 
bands  is  about  98  percent,  although  the  articulation  index  is 
90.  Experimentally  determined  relations  between  the 


Figure  16.33 

The  masking  of  speech  by  sine  waves,  square  waves 
and  pulses.  The  ordinate  shows  the  change  in  the 
threshold  of  perceptibility,  or  the  masking.  The 
peak  voltage  applied  to  the  earphones  for  each 
masking  signal  was  held  constant  at  one  volt  for 
the  curves  labeled  0 db.  For  a sine  wave,  one 
volt  peak  to  peak  corresponds  to  a SPL  of  112  db. 

All  measurements  were  made  blnaurally  (From  Stevens, 
Miller  and  Truscott  II7/) 
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articulation  scores  for  several  types  of  test  material  and 
the  articulation  index  are  shown  In  Pig.  16.37  105/. 

When  filtered  speech  Is  masked  by  noise  the  articulation 
Index  is  decreased  to  an  extent  depending  on  the  Intensity 
level  and  spectrum  of  the  masking  noise . For  masking  noises 
that  have  a reasonably  uniform  and  continuous  spectrum  level, 
the  articulation  Index  may  be  evaluated  by  procedures  de- 
veloped by  French  and  Steinberg  105/,  and  Beranek  106/ . 

Let  us  focus  our  attention  on  one  of  the  10  bands  of 
equal  contribution  to  the  articulation  index.  The  Intensity 
levels  of  the  speech  in  this  band  vary  during  continuous  speech 
over  a range  of  about  30  decibels,  and  the  cumulative  distri- 
bution of  levels  measured  on  a decibel  scale  is  approximately 
linear,  as  shown  in  Pig.  16.26.  The  noise  in  this  frequency 
band  will  mask  the  speech  by  an  amount  depending  on  the  level 
of  the  noise  relative  to  the  30  decibel  range.  The  assumption 
made  by  French  and  Steinberg  and  by  Beranek,  and  supported  by 
experimental  articulation  data,  is  summarized  as  follows:  the 

contribution  of  the  band  to  the  overall  articulation  index  is 
equal  to  the  articulation  index  for  the  unmasked  band  multi- 
plied by  the  fraction  of  the  30  decibel  range  that  is  not 
covered  by  the  noise.  Both  speech  and  noise  levels  are  mea- 
sured in  terms  of  decibels  per  cycle. 

Beranek  has  summarized  the  above  concepts  of  articulation 
index  in  the  form  of  a chart,  shown  in  Pig.  I6.38.  The  normal, 
undlstorted  speech  spectrum  and  the  range  of  speech  levels  are 
represented  by  the  shaded  speech  area.  Adjustment  of  the  gain 
or  the  frequency  response  of  the  speech  communication  link 
moves  the  speech  levels  (as  a function  of  frequency)  up  or  down 
on  the  decibel  scale.  The  frequency  scale  is  so  selected  that 
equal  distances  along  this  scale  provide  equal  contributions 
to  articulation  index.  If  the  spectrum  level  of  a masking 
noise  is  plotted  on  the  same  chart,  the  portion  of  the  speech 


Figure  16.34 

Articulation  scores  for  monosyllabic  words  were 
obtained  in  the  presence  of  masking  noise  inter- 
rupted at  various  rates . The  noise  was  on  half 
the  time,  off  half  the  time.  The  parameter  is 
the  signal -to-nolse  ratio  in  decibels.  (Prom 
Miller  and  Llckllder  II8/) 
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area  that  remains  unmasked  by  the  noise  spectrum  represents 
the  articulation  index  for  the  system  in  the  presence  of  the 
given  noise. 

It  is  to  be  emphasized  that  Beranek's  chart  is  applicable 
only  to  cases  in  which  the  amount  of  distortion  of  the  speech 
along  the  frequency  dimension  does  not  change  abruptly  and  in 
which  the  spectrum  of  the  masking  noise  is  reasonably  uniform. 
For  example,  the  chart  can  certainly  not  be  uped  to  predict 
the  Intelligibility  of  speech  masked  by  a pure  or  complex  tone. 
To  a first  approximation  we  may  replace  a single -frequency 
masker  (that  is  well  separated  from  other  components  of  the 
masking  signal)  by  a band  of  noise  having  the  same  sound  pres- 
sure level  and  whose  bandwidth  is  equal  to  the  critical  band. 

A more  general  but  also  more  complex  procedure  for  the  computa- 
tion of  articulation  index  has  been  presented  by  Fletcher  and 
Galt  123/.  This  procedure  presumably  applies  to  all  types  of 
distortion  and  to  different  masking  spectra. 

Speech  Interference  Level . The  computation  of  the  articu- 
lation index  is  frequently  rather  time-consuming,  and  such  de- 
tailed calculations  are  often  not  warranted  in  engineering 
practice.  Also,  measurements  of  the  spectrum  of  the  masking 
noise  are  not  made  in  sufficient  detail  to  define  at  each 
frequency  the  spectrum  level  of  the  noise.  Usually  the  sound 
pressure  level  is  measured  in  octave  bands  of  frequency,  four 
of  which  are  shown  by  the  vertical  lines  in  Fig.  l6.39. 

About  80  percent  of  the  important  range  of  speech 
frequencies  is  covered  by  the  three  octave  bands  600-1200, 
1200-2400  and  2400-4800  cps.  This  is  shown  in  Fig.  16.38  in 
which  the  frequency  scale  is  distorted  in  such  a fashion  that 
equal  distances  along  the  scale  correspond  to  equal  contribu- 
tions to  articulation  index.  Specification  of  the  levels  of 
continuous  spectrum  masking  noise  in  these  three  octaves  pro- 
vides data  that  permit  computation  of  the  articulation  index 
to  a reasonable  degree  of  approximation  126/. 


Figure  16.35 

Syllable  articulation  for  speech  passed  through 
high  pass  and  low  pass  filters  in  the  quiet . The 
gain  is  adjusted  to  a value  that  gives  maximum 
articulation  score  (From  French  and  Steinberg  103/)  • 
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The  speech  Interference  level  has  been  defined  as  the 
arithmetic  average  of  the  sound  pressure  levels  (in  decibels) 
In  the  three  octave  bands  600-1200,  1200-2400  and  2400-4800 
cps . It  Is  a single  number  that  indicates  the  Interfering 
effect  of  noise  on  speech. 

A masking  noise  following  the  contour  of  the  level  of 
the  speech  minima  In  Fig.  I6.38  would  have  a speech  inter- 
ference level  of  about  43  decibels.  For  a speech  level  of 
69  db  at  one  meter,  which  corresponds  to  a raised  voice,  a 
masking  noise  with  a speech  Interference  level  of  43  db  would 
have  a negligible  effect  on  Intelligibility.  As  will  be  shown 
in  Section  I8.3  reliable  conversation  can  be  carried  on  at  a 
much  higher  speech  Interference  level. 

A more’  detailed  discussion  of  the  speech  interference 
level.  Its  applications  and  Its  limitations  are  presented  in 
Section  l8.3^  which  deals  with  speech  communication  criteria 
for  noise  control . 


Figure  I6.36 

The  effects  of  peak  clipping  upon  the  Intelligi- 
bility of  monosyllabic  words.  When  peak  clipping 
is  infinite,  the  articulation  score  is  approxi- 
mately 70  percent . (After  Llckllder  and  Miller  123/) . 
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Figure  16.37 

Approximate  relations  between  articulation  index 
and  articulation  score  for  several  types  of  test 
material . (Prom  French  and  Steinberg  105/) . 
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Figure  I6.38 

Chart  for  computing  the  articulation  index  for 
speech  subjected  to  frequency  distortion  and 
masked  by  continuous  spectrum  noise . If  the 
level  of  the  speech  differs  from  the . level  shown 
on  the  chart  (69  db),  the  band  representing  the 
speech  area  is  moved  up  or  down  accordingly. 

The  shape  of  the  spectrum  is  likewise  distorted 
to  account  for  any  frequency  distortion  in  the 
communication  system.  The  spectrum  level  of 
the  masking  noise  is  plotted  on  the  graph.  The 
fraction  of  the  total  speech  area  that  projects 
above  the  masking  noise  is  equal  to  the  articu- 
lation index  (Prom  Beranek  106/) . 
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CHAPTER  17 


EFFECTS  OF  NOISE  ON  HUMAN  BEHAVIOR 
17.1  Introduction 


"Does  noise  affect  human  behavldr?”  Almost  universally, 
this  trivial  question  will  be  answered  by  a firm  "yes”. 

Suppose,  however,  we  probe  deeper  and  ask  two  more  specific 
questions:  "How  does  noise  affect  human  behavior?"  and  "What 

noise  affects  which  behavior?"  There  is  less  agreement  on 
the  answers  to  these  two  questions,  and  in  many  cases  no  de- 
finite answers  have  been  proposed. 

Opinions  expressed  in  the  literature  are  divided  roughly 
into  two  camps . (See  Kryter  1,2/,  for  example ) . Certain 
authors  try  to  show  that  noise,  like  the  weather,  controls 
practically  all  of  man's  activities,  and  that  it  Influences 
his  efficiency  and  his  creativity.  Some  even  proclaim  that 
noise  is  a determining  factor  in  man's  physical  and  psycho- 
logical well-being,  to  a point  where  noise  is  blamed  for  the 
declining  birth  rate  and  for  the  Increase  in  the  number  of 
admissions  to  mental  hospitals 

On  the  other  hand  there  is  a group  that  is  smaller  in 
size  but  not  less  firmly  rooted  in  its  convictions.  These 
authors  admit  that  noise  Interferes  with  auditory  communication, 
and  that  it  may  even  - through  the  fact  that  in  many  activities 
people  have  to  talk  to  each  other  or  would  like  to  be  able  to 
talk  to  each  other  - Interfere  with  certain  communlcatlon- 
dependent  human  functions.  Concerning  most  other  claims,  how- 
ever, this  group  points  to  the  negative  results  of  controlled 
laboratory  experiments.  Most  of  the  laboratory  data  collected 
to  this  day  indicate  that  many  of  man's  activities  are  rela- 
tively unaffected  by  noise. 

Before  attempting  to  review  at  least  part  of  the  evidence 
upon  which  the  two  opposite  camps  base  their  arguments,  we 
should  see  if  we  can  agree  on  what  we  are  arguing  about . What 
do  we  mean  when  w©  talk  about  "noise"  and  about  "behavior"? 
Suppose  we  ask  a communication  engineer  whether  a particular 
communication  link  is  able  to  transmit  signals  in  the  presence 
of  noise . In  return,  the  communication  engineer  would  want 

to  know  what  signals  are  to  be  transmitted  and  what  kind  of 
noise  is  present.  Our  problem  is  somewhat  analogous.  In  order 
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to  predict  the  effects  of  noise  upon  human  behavior,  we  must 
specify  both  the  noise  and  the  behavior  (l.e.,  the  "signal”). 

The  accuracy  of  our  predictions  will  depend  upon  the  precision 
of  our  measurements  of  both  the  noise  and  the  behavior. 

Various  definitions  could  be  given  for  the  two  terms  6/. 

The  Acoustical  Terminology  of  the  American  Standards  Associa- 
tion defines  noise  as  "any  undeslred  sound".  Thus  our  Instru- 
ments must  measure  the  characteristics  of  an  acoustic  event 
that  people  have  designated  as  undesirable,  or  an  acoustic 
event  that  we  have  good  reason  to  assume  produces  deleterious 
effects . 

The  humain  behavior  In  which  we  are  Interested  expresses 
Itself  by  all  kinds  of  physiological  and  psychological  responses. 
Preferably  the  responses  should  be  sufficiently  obvious  that 
they  can  be  observed  by  others,  but  we  cannot  usually  be  too 
restrictive  In  the  way  In  which  we  measure  behavior.  We  shall 
accept  data  from  opinion  polls,  complaints  to  newspapers, 
management  or  even  neighbors  as  expressions  of  behavior,  pro- 
vided they  can  be  observed  with  adequate  reliability. 

In  the  following  sections  we  shall  review  some  of  the 
reported  evidence  that  relates  noise  and  behavior.  We  shall 
not  repeat  here  the  data  on  masking  (Sec.  l6.2  and  l6.4).  In 
which  noise  (which  In  some  cases  could  even  be  a pure  tone  or 
a piece  of  popular  music)  Interferes  with  the  detection  of 
acoustic  signals.  Since  masking  studies  are  usually  Integrated 
with  the  basic  data  on  hearing,  we  discussed  them  In  Chapter  l6. 

We  do  not  claim  to  have  treated  the  subject  of  masking 
exhaustively.  For  example  we  have  not  studied  cases  of  mask- 
ing that  Involve  hlgh-lntenslty,  low-frequency  sounds . Fur- 
thermore, we  have  not  dealt  with  such  problems  as  the  possible 
Interaction  of  vibratory  phenomena  with  the  detection  of 
acoustic  signals.  In  both  Instances,  lack  of  adequate  data 
explains  our  reluctance  to  speculate. 

Our  treatment  In  this  chapter  will  not  attempt  to  review 
the  literature  In  detail,  since  the  literature  Is  too  massive 
and  also  too  complex  for  such  a task.  We  shall,  however, 
attempt  to  Indicate  the  considerations  that  are  Important  for 
a critical  understanding  of  the  work  that  has  been  reported, 
and  we  shall  also  try  to  point  out  the  problems  that  exist  In 
the  various  areas . 
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17.2  Effects  upon  Bodily  Functions  Other  than  Hearing 


The  effects  to  be  considered  here  fall  Into  two  general 
categories:  (a)  Intersensory  effects,  l.e,,  situations  In 

which  the  presence  of  a noise  stimulus  affects  psychophysical 
responses  to  other  sensory  stimuli,  and  (b)  effects  of  noise 
upon  general  bodily  functions  such  as  heart  rate,  metabolism, 
or  galvanic  skin  response . 

Intersensory  effects  at  normal  levels  of  stimulation 
have  been  of  considerable  Interest  to  psychologists  for  many 
years . Apparently  there  are  no  really  safe  generalizations 
from  present  data.  Hence  we  shall  not  attempt  to  discuss  whe- 
ther an  acoustic  stimulus  enhances  or  Inhibits  detection  of, 
say,  a visual  signal . Our  chief  concern  In  this  handbook  Is 
with  those  Intersensory  effects  that  are  observable  In  noise 
of  high  Intensity. 

It  Is  relatively  easy  to  see  how  general  bodily  functions 
might  be  affected  by  the  presence  of  an  acoustic  signal  If  such 
a signal  connotes  danger  (sound  of  approaching  bomber  or  missile) 
or  If  It  Is  associated  with  a pleasurable  event.  It  Is  also 
clear  that  a person  who  suffers  from  Insomnia  may  develop 
various  kinds  of  symptoms  If  the  sound  of  an  approaching  air- 
plane or  truck  Intrudes  Just  when  he  Is  about  to  doze  off. 

More  dramatically,  a person  may  be  frightened  by  an  unexpected 
noise  Just  as  he  might  be  startled  If  he  were  suddenly  drenched 
with  Ice-cold  water.  It  Is  clear  that  we  cannot  pretend  to 
establish  acoustic  design  criteria  that  takes  Into  account  the 
unexpectedness  or  the  connotation  of  such  noises.  Here  again 
we  must  restrict  our  survey  to  those  changes  In  bodily  responses 
that  can  be  observed  reliably  (though  not  without  variability 
between  Individuals).  This  again  amounts  to  a restriction  of 
our  Interest  to  the  effects  of  high- Intensity  noise . 

For  the  purposes  of  this  section,  we  shall  refer  to  noises 
In  and  above  the  zone  of  tolerance  thresholds  as  high- Intensity 
noises . The  overall  sound  pressure  level  Is  approximately  120 
db  and  above . 

We  should  remember  that  In  environments  where  the  SPL  Is 
so  high,  man  often  finds  himself  In  the  presence  of  many  other 
kinds  of  stimuli  that  may  be  emitted  by  high-energy  sources 
(for  example,  the  heat  and  fumes  given  off  by  a Jet  engine). 

Man  may  often,  especially  In  military  situations,  find  himself 
In  a position  of  considerable  stress  In  which  noise  Is  only 
one  of  several  stressful  stimuli.  In  such  circumstances  It 
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would  be  Inadvisable  to  attribute  changes  In  man's  bodily 
performance  to  the  noise  alone  If  careful  control  of  the  other 
variables  is  not  guaranteed. 

With  the  development  of  Jet  aviation,  considerable 
attention  has  been  focused  on  the  effects  of  ultrasonics . 
Following  World  War  II,  many  wild  reports  attributed  extra- 
ordinary powers  to  sound  waves  at  Inaudible  frequencies . To- 
day there  Is  agreement  that  most  of  the  effects  that  have  been 
reliably  observed  7/  were  a function  of  the  energy  In  the 
acoustic  stimulus  and  not  of  the  frequency. 

The  remarks  on  the  effects  of  ultrasonics  should  be 
tempered  by  the  realization  that  the  short  wavelengths  permit 
focusing  of  the  energy  in  the  ultrasonic  beam  to  an  extent 
that  is  clearly  Impossible  In  the  audio  range . Parrack  8/ 
has  pointed  out.  Incidentally,  that  turbo-jet  engines  produce 
relatively  little  energy  In  the ’ ultrasonic  range. 

Some  of  the  above  observations  are  contradicted  by  the 
findings  of  Grognot  ^ . Describing  certain  changes  In  the 
blood  circulation  of  his  subjects  after  exposure  to  airborne 
sounds  of  frequency  25  kcps,  Grognot  states  that  the  effects 
were  lacking  for  subjects  who  were  deaf  or  who  wore  ear  pro- 
tectors . Confirmation  by  other  experiments  will  be  needed 
before  these  observations  are  generally  accepted. 

Recently  certain  European  authors  have  reported  effects 
on  the  hearing  of  small  animals  after  exposing  the  animals' 
ears  to  large  doses  of  ultrasonics.  There  Is,  however,  little 
reason  to  conclude  that  ultrasonics  have  a particular  effect 
upon  the  auditory  system.  Since  ultrasonics  Interact  In 
general  with  biological  materials,  there  Is  no  reason  why  they 
should  not  also  affect  animals'  ears. 

In  a paper  entitled  ”Physiological  and  Psychological 
Effects  of  Noise,”  Parrack  10/  summarizes  the  findings  of  the 
Aero  Medical  Laboratory  at  Wright -Patterson  Air  Force  Base  In 
the  following  manner; 

"Action  of  Sound  on  Other*  Body  Mechanisms 

"Receptors  In  the  Body  .Surface ; Low  frequency  sound 
waves  stimulate  receptors  In  the  skin.  A sound  at  a level  of 


i.e.,  other  than  the  mechanism  of  hearing 
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110  db  can  be  felt  by  the  hand  when  the  frequency  Is  around 
100  cps . Under  certain  conditions  these  sensations  from  body 
receptors  can  be  sufficiently  strong  to  give  the  impression 
that  the  earth  or  the  building  that  supports  us  is  vibrating 
violently. 

"Receptors  of  the  Joints  and  Tendons:  Low  frequency 

sound  waves  may  vibrate  the  muscles  of  our  arms  or  legs  and 
also  the  thoracic  and  abdominal  walls . Receptors  located  in 
Joints  or  In  tendons  are  stimulated.  This  may  lead  to  reduced 
ability  to  perform  critical  manual  acts . Sensations  from 
these  receptors  acting  in  combination  with  those  from  the  body 
surface  receptors  cause  us  to  rate  the  sound  field  as  very 
unpleasant  or  extremely  annoying. 

"Heat  Receptors:  In  very  Intense  sounds  (ca  150  db)  at 

frequencies  between  2000  and  25,000  cps  there  may  be  a mild 
warming  of  the  body  surface.  If  there  are  narrow  channels 
between  areas  of  skin,  as  when  the  hand  is  held  In  the  sound 
field  with  the  fingers  loosely  approximated,  the  heating  action 
may  be  sufficiently  great  to  cause  actual  pain.  Heating  is 
also  pronounced  under  the  edge  of  one's  collar  or  at  the  cuffs 
of  a coat  or  shirt.  Certain  Items  of  clothing  may  absorb  suf- 
ficient sound  energy  to  become  quite  warm  (a  temperature  rise 
from  32°C  to  59°C  In  1 min.)  and  thus  secondarily  stimulate 
receptors  In  the  skin.  Sound  fields  of  such  Intensities  are 
rated  very  unpleasant  and  are  obviously  avoided  when  possible . 

"Absorption  of  Sound  at  the  Body  Surface:  Since  the 

publication  of  the  data  relating  to  the  action  of  high  fre- 
quency sound  waves  on  small  furred  animals  11,12,13/  there  has 
been  much  speculation  about  the  action  of  Intense  high 
frequency  sound  on  man.  Brings,  Allen  and  Rudnlck,  12/  exposed 
mice  to  sound  waves  from  a siren  at  levels  of  approximately 
l60  db.  They  pointed  out  that  the  animals  were  killed  by  over- 
heating and  that  the  heating  process  proceeded  much  more 
rapidly  In  furred  animals  than  In  those  from  which  the  fur  had 
been  removed.  At  the  Aero  Medical  -Laboratory  these  studies 
have  been  confirmed  14/  using  rats  as  the  experimental  animal . 

At  the  same  time  It  was  observed  that  men  exposed  to  the  same 
sound  fields  were  not  Injured  but  occasionally  experienced 
heating  between  the  fingers  as  discussed  earlier  under  heat 
receptors . Investigation  of  the  absorption  coefficients  of 
the  body  surfaces  of  animals  and  man  14, 15/  provided  an  adequate 
explanation  of  the  difference  In  the  action  of  the  sound  energy. 
I shall  simply  summarize  the  present  status,  here,  by  saying 
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that  there  Is  no  evidence  that  air-borne  sound  waves  at  ulta- 
sonlc  frequencies  constitute  a hazard  to  man.  There  Is  definite 
evidence  that  ultrasonic  frequencies  at  the  energy  levels 
generated  by  current  aircraft  power  plants  do  not  constitute 
a hazard  to  man.  On  the  other  hand  the  physical  properties 
of  the  human  body  are  such  and  the  energy  level  of  the  sound 
waves,  at  frequencies  below  200  cps.  Is  such  that  there  Is  a 
probability  of  Injury  from  the  low  frequencies . The  major 
hazard  to  man  of  sound  in  the  frequency  range  200  cps  to 
20,000  cps  Is  the  action  on  the  human  ear  and  possibly  on  other 
specialized  receptor  systems  when  the  sound  waves  are  extremely 
Intense . ” 7s 16/ 

Elsewhere,  Parrack  17/  has  given  these  further  details 
on  vibratory  phenomena: 

"Sound  levels  of  130  db  and  above  stimulate  receptors  In 
the  mouth,  nasal  passages  and  external  ear  canal . The  end 
organs  stimulated  are  probably  the  skin  pressure  receptors. 

"When  the  sound  level  at  certain  frequencies  Is  about 
l40  db  we  perceive  a strong  sensation  of  vibration  of  the  skull . 
This  sensation  Is  most  prominent  when  the  frequency  is  between 
750  and  2000  cps.  The  frequency  arousing  maximum  sensation  at 
a given  sound  level  may  be  different  for  different  Individuals. 
These  sensations  are  particularly  annoying  and  few  persons 
willingly  remain  In  such  a sound  field. 

"in  the  presence  of  sounds  at  levels  above  l40  db, 
especially  when  the  frequency  Is  below  1000  cps,  the  chest 
wall,  the  abdominal  wall  and  even  the  arm  and  leg  muscles  are 
set  In  vibration.  These  vibratory  sensations  become  more 
prominent  as  the  frequency  decreases  at  least  down  to  100  cps . 
Such  sensations  are  not  only  annoying  but  at  least  In  some 
individuals  nausea  and  vomiting  are  Induced.  There  Is  also 
evidence  that  the  vibrations  of  the  body  muscles  produce  re- 
flex Inhibitions  which  make  precise  muscular  control  difficult. 
These  reactions  markedly  reduce  man's  performance  efficiency." 

In  a Memorandum  Report  In  May,  1948,  Parrack  and  his 
collaborators  18/  state  also: 

"There  have  been  occasions,  during  exposure  to  these 
low  frequency  sound  fields,  both  In  the  laboratory  and  at  the 
Jet  engine  test  cells,  when  personnel  have  observed  a "weak- 
ness In  the  knees"  or  an  apparent  general  weakening  of  the 
body  supporting  musculature.  This  sensation  Is  not  accompanied 
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by  faintness  or  vertigo  and  is  probably  not  the  result  of  a 
true  muscular  weakness.  It  would  appear  to  result  from  an 
effect  on  the  proprioceptive  reflex  mechanism  since  with  con- 
scious effort  one  can  maintain  the  normal  erect  position 
usually  maintained  by  reflex  mechanisms . ” 

Finally  Parrack  and  others  report  blurred  vision  in  the 
presence  of  Intense  vibration  of  the  cranial  bones  or  of  the 
eyeballs.  Recovery  after  moderate  exposure  is  apparently 
immediate  and  complete  as  soon  as  the  subject  leaves  the  sound 
field. 


More  recently,  Dickson  and  Chadwick  have  reported  their 
"Observations  on  Disturbances  of  Equilibrium  and  Other  Symptoms 
Induced  by  Jet-Engine  Noise."  19/  They  refer  to  mild  and 
transitory  symptoms  of  momentary  dizziness,  unsteadiness  and 
even  "mental  aberration"  in  personnel  working  close  to  turbo- 
jet engines.  They  suggest  that  the  observed  phenomena  are  a 
manifestation  of  the  "Tulllo  reaction,"  l.e.,  the  production 
of  head  motion  by  means  of  loud  sounds.  (See,  for  example. 

Pig.  48  of  ref.  20/) . Dickson  and  Chadwick  feel  that  the 
above-mentioned  symptoms  may  be  due  to  intense  acoustic  stimu- 
lation of  the  vestibule  or  even  to  a leakage  phenomenon 
between  the  auditory  and  vestibular  branches  of  the  eighth 
nerve . The  British  authors  state  that  the  symptoms  appear  to 
be  more  marked  if  engine  speeds  are  changed  than  when  an  engine 
is  run  at  constant  speed. 

Besides  the  effects  we  have  designated  as  Intersensory, 
there  are  numerous  studies  dealing  with  the  effects  of  noise 
upon  various  physiological  functions . Kryter  \/  has  made  a 
careful  review  of  the  literature  in  this  area.  He  finds  that 
for  steady  noise  of  SPL  below  120  db,  human  organisms  seem  to 
adapt  in  a satisfactory  manner,  provided  they  are  given  enough 
time  to  do  so.  One  of  the  latest  studies  that  confirm  these 
conclusions  is  that  by  Pinkie  andPoppen  2l/.  We  quote  the 
following  paragraphs  from  the  summary  of  their  paper: 

"Experimental  data  derived  from  exposing  9 volunteer 
Navy  enlisted  men  and  a medical  officer  at  various  positions 
near  a General  Electric  I-l6  turbo-jet  engine  for  a total  of 
20  hours  over  a period  of  6 weeks  revealed  the  following: 
a)  Increase  in  fatigue  and  Irritability  during  the  entire 
course  of  the  experiment  in  7 of  the  10  subjects.  The  others 
noted  no  change,  b)  Early,  temporary,  sharp  decrease  in 
auditory  acuity  in  the  'conversational  frequency  range'  of  512 
to  4096  cps.  Normal  hearing  was  gradually  regained  by  the  7 
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subjects  affected  within  12  hours  or  less  after  onset,  c)  Loss 
in  weight  of  5 1/2  to  19  pounds  in  5 of  9 subjects.  It  is  ques- 
tionable that  this  finding  is  a result  of  the  experiment, 
d)  Rise  in  fasting  blood  sugar  during  a one-hour  period  of 
exposure  to  the  Jet  engine  in  all  subjects  and  decrease  in  the 
fasting  blood  sugar  level  after  a two-hour  exposure  in  7 of  the 
10  test  subjects,  e)  Normal  findings,  unchanged  from  pre- 
experlmental  values,  were  noted  for  blood  counts,  urinalyses, 
kidney  function  tests,  bleeding  and  clotting  times,  icteric 
indices,  upper  gastrointestinal  x-ray  studies,  electro- 
cardiograms and  electroencephalograms  during  and  after  the  ex- 
periment . 

"Noise  analyses  of  this  engine  revealed  overall  intensi- 
ties of  120  db  to  7500  cps,  and  peaks  of  sound  were  noted  up 
to  38,000  cps." 

There  are,  of  course,  practical  situations  for  which  the 
Pinkie -Poppen  experiments  would  not  constitute  sufficiently 
realistic  tests.  We  can  think  of  Instances  in  which  an  indi- 
vidual would  not  have  time  to  adapt  to  noise,  but  in  which 
the  In^jortant  instant  is  at  the  onset  of  the  noise . An  under- 
standing of  such  noise -emergency  situations  is  clearly  not 
possible  at  present. 

Hale  21/  recently  reported  that  Increased  adreno-cortlcal 
activity  was  associated  with  intense  engine  noise . It  is  cer- 
tainly not  unreasonable  to  assume  that  there  are  levels  of 
acoustic  noise  that  will  overtax  the  adaptive  and  compensatory 
mechanisms  of  man. 

Some  individuals  may  have  to  carry  out  taxing  tasks  in 
surroundings  where  noise  levels  are  of  the  order  of  150  to  l60 
db.  The  first  necessity  is  to  provide  all  the  ear  protection 
that  is  available.  But  we  may  have  to  go  further.  We  may 
have  to  set  up  tests  that  will  select  men  who,  on  the  basis  of 
their  physiological  responses,  their  motor  skills  and  other 
aptitudes,  are  best  qualified  to  live  and  operate  in  such 
sound  fields. 
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17.3  Noise  and  Annoyance;  Noise  and  Performance 


Prom  time  Immemorial  men  have  complained  about  noise . 
Poets  and  housewives  have  been  equally  eloquent  In  their 
tirades,  declaring  that  noise  "drives  them  crazy".  While  we 
have  no  reason  to  doubt  the  sincerity  of  such  complaints, 
they  do  not  have  the  status  of  scientific  evidence. 

People  complain  about  many  noises  because  they  "annoy" 
them,  or  because  they  think  the  noise  Interferes  with  the 
performance  of  their  work.  These  complaints  occur  though  there 
are  usually  no  direct  effects  on  body  functions  other  than 
hearing  (the  subject  of  Sec.  17.2),  and  there  Is  no  question 
of  permanent  or  temporary  effects  on  hearing  because  of  ex- 
posure to  high  Intensities  (See.  17.4).  We  shall  be  concerned 
In  this  section  with  the  general  question  of  the  relation  of 
noise  (not  necessarily  high  Intensity  noise)  to  annoyance  and 
to  the  performance  of  tasks . 

Noise,  a Source  of  Annoyance.  Annoyance  Is  not  a con- 
cept that  we  can  easily  quantify.  It  Is  complex,  as  some 
trivial  Illustrations  will  Indicate.  For  example,  most  people 
In  noisy  cities  have  learned  to  sleep  In  this  environment, 
although  some  of  them  may  have  trouble  falling  asleep  In  the 
country,  with  Its  relatively  low-level  noise  of  rustling  leaves, 
frogs  and  birds . Apparently,  man  can  become  adapted  to  a large 
range  of  physical  environments,  although  all  people  do  not 
adapt  equally  well.  We  see  that  the  Important  physical  quan- 
tity Is  not  simply  overall  sound  pressure  level,  or  even  the 
octave  band  pressure  levels.  Except  at  extremely  high  levels, 
people's  reaction  bo  noise  depends  upon  many  factorsj  only  a 
few  of  which  are  the  aooustlc  characteristics  of  the  noise . 

For  a given  noise,  we  can  certainly  state  that  annoyance 
will  Increase  with  Intensity  (and  hence  with  loudness),  but  we 
are  unable  to  measure  a general  threshold  of  annoyance . Such 
a Judgment  would  depend  upon  the  task  that  the  noise  Interferes 
with,  the  kinds  of  noises  the  subject  has  been  exposed  to  In  the 
past,  etc.  We  could,  for  example,  rank  various  sources  of 
sound  In  the  order  In  which  they  would  annoy  us  If  we  were 
listening  to  a concert.  The  same  rank  order  would  not  neces- 
sarily apply  If  the  noise  lntei:fered  with  our  sleep. 

It  Is  apparent,  then,  that  annoyance  responses  to  noise 
are  truly  multl-dlraenslonal . They  are  compounded  of  reactions 
to  noisy  events  that  Irritate  you,  that  startle  you,  that  pre- 
vent sleep,  that  prevent  you  from  calling  to  your  child,  that 


WADC  TR  52-204 


147 


make  it  impossible  for  you  to  hear  the  radio,  that  strike 
you  as  most  unnecessary,  etc . 

Recently,  the  French  acoustician  Moles  23/  has  tried  to 
classify  noises  into  three  categories:  (l)  those  that  are 

disagreeable  or  "get  on  your  nerves";  their  level  is  less  than 
40  db,  and  they  are  effectively  suppressed  (according  to  Moles) 
by  masking;  (2)  the  noises  that  bother,  disturb  or  annoy  you, 
whose  levels  are  between  50  and  80  db;  their  symptoms  are 
interference  with  speech  communication,  nervous  fatluge  and 
Interference  with  abstract  thinking;  and  (3)  the  destructive 
noises,  that  lie  above  100  db,  and  that  attack  directly  the 
biological  organism.  If  some  of  the  above  terms  were  defined 
by  operations  of  measurement,  and  if  certain  further  specifi- 
cations of  the  noise  were  provided  beside  the  overall  level, 
a classification  such  as  Moles'  cam,  perhaps,  provide 
hypotheses  for  experimental  investigations.  For  the  present, 
however,  the  classification  is  merely  a verbal  scheme  with 
little  power  of  prediction. 

Let  us  now  turn  to  laboratory  investigations  of  the 
concept  of  annoyance . It  stands  to  reason  that  we  should  pro- 
ceed with  caution  in  extrapolating  from  laboratory  findings  in 
this  area  to  the  everyday  behavior  of  our  fellow  citizens . 
Kryter  \J  reviews  most  of  the  experimentation  and  comes  up  with 
certain  generalizations  on  various  aspects  of  noise  such  as 
its  unexpectedness.  Inappropriateness  and  Intermittency . The 
acoustical  characteristics  that  he  considers  are  the  intensity 
and  the  frequency  pattern  of  the  noise . 

Reese  and  Kryter  24/  conducted  an  experiment  in  which 
their  subjects  were  asked  to  equate  bands  of  noise  for  loud- 
ness and  for  annoyance.  For  the  latter  task  the  five  subjects 
adjusted  the  Intensity  of  bands  of  noise  (250  mels  wide)  until 
they  felt  that  they  were  each  equally  tolerable  when  compared 
to  a standard  band.  The  noises  were  presented  by  means  of 
earphones . The  data  of  Reese  and  Kryter  (with  their  consider- 
able inter-subject  variability)  show  that  at  the  higher 
frequencies  above  3000  cps  equal  annoyance  can  be  achieved 
with  less  Intensity  than  equal  loudness.  For  speculations  on 
the  physiological  basis  of  this  phenomenon,  see  Husson  25/. 

Pollack  26/  in  a later  experiment  required  three  subjects 
to  discriminate  various  psychological  attributes  of  bands  of 
noise.  Annoyance  (defined  as  "the  disturbingness  of  bother- 
someness of  the  sound)  turned  out  to  be  one  of  the  attributes 
that  was  relatively  clearly  differentiated  from  loudness.  At 


WADC  TR  52-204 


148 


both  ends  of  the  frequency  spectrum  Pollack's  observers 
assured  him  that  bands  of  noise  less  loud  than  the  standard 
band  (either  60-5000  cps  or  129O-I9OO  cps  were  equal  to  it 
in  annoyance . 

G.  A.  Miller  in  an  investigation  on  the  masking  of 
speech  27 A was  also  concerned  with  the  problem  of  annoyance. 
Since  his  approach  coincides  with  much  that  we  have  already 
said,  we  shall  quote  a rather  extensive  passage  from  his  paper: 

"When  we  take  this  problem  [of  annoyance]  into  the 
laboratory,  it  seems  to  disappear  right  in  front  of  our  ears . 

The  major  difficulty  rests  in  the  fact  that  the  listener's 
attitude  is  so  Important . If  he  is  engaged  in  difficult  mental 
work,  it  may  be  relatively  easy  to  annoy  him.  But  if  he  listens 
with  a defiant  attitude,  any  attempts  to  upset  him  with  strange 
noises  may  prove  more  amusing  than  effective . And  since  most 
of  the  sounds  ,we  can  use  in  the  laboratory  are  out  of  context 
and  relatively  meaningless,  the  task  of  being  successfully 
obnoxious  is  practically  impossible . Annoyance  depends 
primarily  upon  the  particular  listener  and  the  particular 
situation  in  which  he  finds  himself. 

"If,  however,  we  are  content  to  Ignore  some  of  the 
situational  variables  Involved,  it  is  possible  to  ask  listeners 
to  compare  different  sounds  on  the  basis  of  their  "annoyance 
value . " Some  simple  listening  situation  is  standardized  and 
the  listeners  compare  pairs,  use  a rating  scale,  or  rank-order 
an  array  of  sounds . One  can  then  evaluate  the  variables 
contributing  to  annoyance  value  as  defined  by  the  situation, 
although  the  safety  with  which  the  results-  can  be  extended  to 
other  situations  is  open  to  question. 

"Listeners  were  presented  with  pairs  of  sounds  and  were 
Instructed  to  indicate  which  of  the  two  sounds  was  more  annoy- 
ing. In  making  this  decision,  the  listeners  were  told  to  Judge 
which  sound  of  the  pair  would  be  more  unendurable  if  they  had 
to  listen  to  it  for  a long  period  of  time.  These  instructions, 
therefore,  constitute  the  definition  of  annoyance.  Fairly 
consistent  results  were  obtained  with  groups  of  10  to  20 
listeners,  and  the  scale  of  annoyanae  constructed  in  this  way 
agreed  closely  with  results  obtained  with  rating  or  rank- 
ordering  procedures . 

"As  an  illustration,  the  results  obtained  with  stepped 
patterns  of  tones  will  be  considered.  Eight  different  variables 
in  the  tonal  pattern  were  studied  for  their  effect  upon 
annoyance -value . 
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"l.  The  higher  the  pitch  of  the  component  tones,  the 
greater  the  annoyance -value . The  range  of  frequencies  tested 
was  from  200  to  150O  cylces. 

"2.  A wide  range  of  frequencies  between  the  highest  and 
lowest  steps  Is  more  annoying  than  a restricted  range.  Listeners 
reported  that  the  wide  range  of  component  frequencies  tended  to 
be  perceived  alternately,  first  as  a complete  pattern  and  then 
as  two  patterns,  one  of  high  and  one  or  low  pitch.  This  effect 
Is  very  similar  to  figure-ground  reversals  in  visual  perception. 

”3.  The  addition  of  continuous  tones  to  the  stepped 
pattern  of  tones  produces  complex  effects  dependent  upon  the 
frequency-relation  between  the  tones.  Beats  give  the  sound  a 
rough  pulsing  Irregularity  which  the  listeners  disliked. 

”4.  Listeners  asked  to  compare  continuous  sounds  of 
different  wave-shapes  found  the  complex  sounds,  especially 
brief  pulses,  more  annoying.  In  general,  the  sine  wave  was 
found  to  produce  little  annoyance . 

"5.  Patterns  of  3^  4,  6 and  12  tones  were  compared,  but 
the  number  of  different  steps  In  the  complete  patte'rn  had  little 
effect  on  the  Judgments  of  annoyance. 

"6.  If  one  of  the  steps  of  a pattern  Is  slightly  longer 
In  duration  than  the  others,  a rhythmic  quality  Is  added  which 
the  listeners  Judged  to  be  more  annoying  than  tones  of  equal 
duration.  Even  more  annoying,  however.  Is  the  pattern  In 
which  all  the  tonal  durations  are  randomly  varying. 

"7.  A slow  rate  of  repetition  for  a pattern  of  tones  Is 
considered  slightly  more  annoying  than  a rapid  rate. 

"8.  Up  to  a certain  limit,  the  annoyance -value  Is  In- 
creased If  silent  Intervals  are  Introduced  between  the  succes- 
sive steps. 

"These  results  typify  listeners ' responses  to  meaningless 
sounds.  When  meaningful  sounds  like  speech  or  music  were  used, 
the  listeners  refused  to  apply  the  word  "annoyance"  In  describ- 
ing them.  Annoyance  did  not  seem  to  be  a proper  dimension  of 
such  sounds,  but  the  listeners  were  agreeable  to  calling  the 
sounds  "distracting."  Apparently,  meaningful  sounds  have  a 
higher  "attention-value"  than  meaningless  sounds. 
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"These  experimental  results,  supplemented  by  results 
obtained  with  other  types  of  sound,  indicate  that  annoyance 
is  related  to  three  aspects  of  the  sound. 

"Loudness . The  most  important  single  factor  in  determin- 
ing annoyance -Judgments  is  the  intensity  of  the  sound.  With 
sufficient  intensity,  any  sound  can  be  made  annoying,  and 
extremely  loud  sounds  produce  actual  pain.  Since  this  variable 
is  so  fundamental  to  annoyance,  care  was  taken  to  equate  the 
intensity  of  the  signals  when  other  aspects  were  being  studied. 

"Pitch.  In  general,  sounds  having  their  energy  concen- 
trated among  the  higher  audible  frequencies  are  more  annoying 
than  low-frequency  noises.  In  this  respect,  the  frequency  of 
the  sound  alters  its  annoyance -value  in  a manner  opposed  to 
the  effect  on  masking.  With  a low-frequency  noise  we  cannot 
hear  speech,  but  with  a high-frequency  noise  we  are  more  apt 
to  be  annoyed. 

"Modulation  of  Loudness  and  Pitch.  A third  important 
factor  is  the  modulation  which  the  sound  undergoes.  Listeners 
report  that  they  prefer  to  listen  to  continuous,  unchanging 
sounds,  and  that  a sound  changing  irregularly  from  moment  to 
moment  is  more  annoying  than  a sound  which  is  changing 
regularly.  Listeners  feel  that  the  distraction  of  a changing 
sound  is  less  desirable  than  the  boredom  of  a constant  sound, 
and  they  retain  this  opinion  even  after  many  hours  of  articula- 
tion testing  in  the  presence  of  different  noises.  Apparently 
the  changes  in  loudness  are  more  effective  than  changes  in 
pitch,  but  the  individual  differences  on  this  point  are  too 
conspicuous  to  permit  a safe  generalization. 

"judgments  of  the  pleasantness,  indifference,  and 
unpleasantness  would  probably  have  led  to  very  similar  con- 
clusions. Thus,  while  the  results  may  be  interesting  as  an 
exercise  in  experimental  esthetics,  the  character  of  the 
problem  has  somehow  been  altered  by  the  experimental  approach. 
The  principal  concern,  it  will  be  recalled,  is  with  annoyance 
as  a hazard  to  vocal  communication.  On  this  score  the  results 
are  consistently  negative,  and  at  no  point  in  the  experimental 
results  is  there  unequivocal  evidence  that  the  articulation 
scores  obtained  by  trained  listeners  in  the  presence  of  an 
annoying  sound  were  lower  than  the  scores  obtained  in  the  pre- 
sence of  an  indifferent  sound  which  had  the  same  acoustic 
spectrum.  With  the  attitude  adopted  by  listeners  in  the 
laboratory  situation,  annoyance  is  not  a hazard  to  communi-catlon. 
And  yet,  sounds  do  differ  in  annoyance  value,  and  annoyance  or 


WADC  TR  52-204 


151 


distraction  does  sometimes  Interrupt  our  verbal  flow.  Perhaps 
the  most  reasonable  generalization,  therefore.  Is  that  when  a 
listener  find  himself  in  a situation  where  he  is  vulnerable  to 
auditory  annoyance,  he  is  most  vulnerable  to  loud,  high-pitched, 
unpredictable  sounds . Just  what  situational  and  attltudlnal 
factors  contribute  to  his  vulnerability, however,  this  research 
does  not  reveal." 

It  Is  safe  to  predict  that  people  will  continue  to 
Investigate  the  problem  of  annoyance  by  noise.  Let  us  hope 
that  they  will  be  able  to  find  at  least  a partial  answer  to 
the  problem  by  adopting  a reasonably  operational  definition  of 
the  annoyance  concept  In  their  experiment  or  opinion  poll. 

Noise,  an  Enemy  of  Performance?  Noise  has  often  been 
blamed  for  poor  performance,  for  lack  of  efficiency,  labor 
turnover  and  absenteeism.  Many  industrial  field  studies 
actually  make  such  claims.  Kryter's  1/  analysis  has  shown  the 
extent  to  which  most  of  these  studies  neglected  to  control 
variables  that  could  have  been. responsible  for  the  effects 
that  were  observed.  Among  the  most  Important  factors  besides 
environmental  variables  such  as  heating,  lighting,  ventilation, 
etc.  that  may  have  been  overlooked  are  (a)  the  role  of  speech 
communication  in  the  working  process,  and  (b)  the  fact  that  a 
field  investigation  that  shows  concern  of  the  management  with 
labor  conditions  may  result  In  Improved  performance  on  the 
basis  of  motivational  factors  alone . 

A physician  who  wants  to  test  the  effectiveness  of  a 
new  drug  divides  his  population  Into  two  groups . He  may  decide 
to  administer  his  drug  to  one -half  of  the  population,  while 
giving  sugar  pills  to  the  other  half.  Such  a procedure  pre- 
sumably equalizes  the  psychological  situation  of  both  groups, 
since  they  have  both  been  given  equal  consideration.  Likewise, 
one  might  want  to  use  In  an  experiment  good  ear  protectors  for 
one  group  and  Ineffective  ones  for  the  other  In  order  to  dupli- 
cate this  medical  technique . 

Recent  laboratory  studies  demonstrate  little.  If  any, 
lasting  effect  of  noise  upon  performance . This  sharp  contrast 
between  field  studies  and  laboratory  studies  Is  not  too  sur- 
prising If  we  remind  ourselves  that  the  listener's  response 
Is  not  Just  a function  of  the  acoustic  stimulus,  but  depends 
upon  other  factors . 

As  an  example,  we  should  like  to  refer  to  a routine 
performance  under  noise  by  Mech  28/.  The  author  gave  four 
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groups  of  fifteen  subjects  verbal  addition  tasks  for  thirty 
minutes  each  day:  fifteen  minutes  In  the  "quiet",  and  fifteen 

minutes  In  an  environment  of  "verbal  noise"  (a  record  from 
the  series  "l  Can  Hear  It  Now"  was  played  at  an  average  level 
of  70  db  SPL  at  the  subjects'  position).  All  subjects  (under- 
graduates) were  told  to  add  as  rapidly  and  as  accurately  as 
possible;  they  were  told  that  they  would  be  given  credit  for 
the  total  number  of  correct  responses.  The  various  groups 
were  Instructed  as  follows:  Group  A was  told  that  the  experi- 

ment concerned  the  effects  of  noise  on  work.  Group  B was  told 
the  same  story  and  also  shown  (faked)  graphs  Indicating  that 
In  previous  experiments  people  had  performed  better  In  the 
presence  of  noise.  Group  C was  told  the  same  story  as  A, 
except  that  they  were  shown  (faked)  graphs  Indicating  that 
people  had  previously  performed  better  In  the  quiet . Finally, 
Group  D was  treated  like  Group  C,  except  that  their  (faked) 
graphs  showed  that  while  In  a previous  experiment  people  per- 
formed at  first  better  In  the  quiet,  they  finally  performed 
better  In  noisy  conditions.  The  results  were  striking.  There 
was  no  significant  difference  for  Group  A between  performance 
In  noisy  or  quiet  conditions.  Groups  B,  C and  D behaved  (at 
a statistically  significant  level)  Just  as  the  (faked)  graphs 
had  "instructed"  them  to.  We  have  no  opportunity  here  to  go 
Into  details  of  the  results,  but  we  feel  that  all  future  In- 
vestigations of  the  effect  of  noise  on  performance  should 
study  them  carefully. 

Perhaps  one  of  the  most  comprehensive  studies  of  the 
effects  of  noise  on  psychomotor  efficiency  was  conducted  at 
the  Psycho-Acoustics  Laboratory  of  Harvard  University  during 
World  War  II.  (See  references  29,  30  for  a detailed  descrip- 
tion of  the  experiments).  Five  carefully  selected  subjects 
participated  In  the  tests  during  a period  of  two  months.  The 
various  tests  were  administered  under  two  conditions:  (a)  air- 
plane noise  whose  overall  level  was  115  db  and  (b)  airplane 
noise  whose  overall  level  was  90  db  (an  Intensity  Just  suffi- 
cient to  mask  casual  laboratory  noises  and  to  discourage 
conversation  among  the  subjects. 

"The  battery  of  psychomotor,  physiological  and  psycho- 
logical tests  Included  measurements  of  the  near  point  of  vision, 
speed  of  accommodation,  speed  of  eye  movement,  visual  acuity, 
muscular  tension,  heart  rate,  finger  tremor,  blood  pressure, 
marksmanship,  card  sorting,  span  of  apprehension,  paper  form 
board  tests,  steadiness  tests,  tapping  board  and  standard 
pursuit  rotors  for  testing  motor  coordination,  coordinated 
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serial  pursuit  meters  which  partially  simulated  the  require- 
ments of  Instrument  flying,  tests  of  coordinated  serial  re- 
action time,  audlometrlc  measurements  of  hearing  loss,  vibra- 
tion tests,  tests  of  serial  disjunctive  reaction  time,  coding 
tests,  judgments  of  distance,  etc." 

Noise  produced  a definite  effect  on  hearing  by  causing 
temporary  threshold  shifts.  However,  the  remainder  of  the 
tests  showed  either  indeterminate  results  (large  inter-  or 
intra-observer  variability)  or  showed  that  the  subjects  per- 
formed as  well  in  the  "noise"  (115  db)  as  in  the  "quiet"  (90 
db) . 


In  conclusion,  the  authors  of  the  study  state  that  "air- 
plane noise  has  at  worst  only  a slightly  detrimental  effect 
upon  functions  involving  motor  coordination,  reaction-time, 
sensory  perceptions,  and  certain  mental  functions.  In  most 
Instances  it  can  be  positively  demonstrated  that  noise  has  no 
effect,  even  after  exposures  lasting  seven  hours.  Functions 
such  as  breathing,  metabolism,  and  muscle  tension  are  some- 
times affected  by  noise,  but  what  the  effect  will  be  seems  to 
depend  upon  the  kind  of  individual  tested,  and  simple 
generalizations  are  not  possible. 

"Nevertheless,  all  subjects  seem  to  prefer  not  to  work 
in  an  Intense  noise,  and  after  a day  under  acoustic  stress, 
they  tend  to  report  a subjective  feeling  of  being  more  tired 
and  irritable . They  ha-ve  a ringing  in  the  ears  and  a temporary 
hearing  loss  to  testify  to  the  severity  of  the  conditions  under 
which  they  and  the  pilots  of  aircraft  labor. 

"The  most  severe  effects  of  noise  are  upon  the  ear  Itself . 
Speech  communications  are  Impaired  and  temporary  losses  in 
hearing  are  produced" . 

This  study  has  been  much  quoted  and  also  at  times 
criticized.  It  would  seem  unwise  to  extrapolate  too  far  from 
the  performance  of  five  carefully  selected  laboratory  subjects 
under  two  conditions  of  noise  to  what  might  happen  to  a group 
of  unselected  subjects  in  30  db  more  noise  in  a real  life 
situation.  Studies  such  as  these,  and  those  conducted  by 
Hanley  and  Williamson  3]^  and  Smith  32/  raise  Important  methodo- 
logical problems.  Can  we  expect  to  bring  problems  like  noise- 
induced  annoyance,  or  noise-induced  decrement  in  performance 
into  the  laboratory  without  disembodying  them  at  the  same  time? 
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A promising  step  in  that  direction  seems  to  have  been 
made  by  Broadbent  33/  In  England.  He  demonstrated.  In  the 
laboratory,  deterioration  In  the  performance  of  certain  tasks 
In  the  presence  of  Intense  noise.  His  ten  subjects  seemed 
unaffected  by  the  presence  of  noise  If  their  task  was  an  easy 
one.  When  given  a more  difficult  "vigilance-task,”  their 
performance  on  "noise  days"  was  worse  than  on  the  quiet  days 
that  came  before  or  after.  This  Is  perhaps  one  of  the  ways 
of  bringing  field  and  laboratory  data  together:  Make  the 

laboratory  situations  more  like  real  life  and  let  us  bring 
the  field  situations  under  better  control  for  variables  other 
than  noise . 

17.4  The  After-Effects  of  Exposure  to  Sound  Upon  Man's  Hearing 


A study  of  these  effects  Is  obviously  of  crucial  Interest 
If  we  wish  to  understand  the  effects  of  noise  upon  human  be- 
havior. We  have  seen  that  sounds  may  affect  man  in  spheres 
other  than  the  auditory  one.  It  Is  In  the  realm  of  hearing, 
however,  that  the  most  striking  and  perhaps  also  best  under- 
stood effects  take  place . 

In  earlier  sections  we  have  dealt  with  the  Interference 
of  simultaneous  noise  with  such  auditory  functions  as  speech 
communication.  We  have  seen  that  masking  can  be  described  as 
a temporary  change  In  man's  sensitivity  to  acoustic  stimuli. 

We  have  shown  that  masking  involves  not  only  a shift  In  thres- 
hold but  also  modification  of  such  psychological  functions  as 
loudness,  pitch,  localization,  etc.  When  we  examined  the  sub- 
ject of  masking  our  attention  was  focused  upon  the  Interference 
effects  of  the  noise  while  It  was  on.  We  did  not  ask  ourselves 
at  that  time  what  happens  after  the  unwanted  sound  ceases 
impinging  upon  our  eardrums . What  happens  In  the  wake  of  any 
acoustic  stimulus? 

The  answer  to  this  question  Is,  as  usual,  not  a simple 
one . It  turns  out  that  we  need  to  specify  a considerable  num- 
ber of  parameters  before  we  can  expect  to  give  a set  of 
adequate  answers.  There  Is,  however,  a physiological  mechanism 
that  we  should  examine  in  some  detail  before  we  approach  the 
specific  topic  under  discussion. 

Organisms  are  exposed  to  all  sorts  of  stimuli,  l.e.,  to 
all  kinds  of  Interactions  with  their  environment.  These  inter- 
actions Involve,  In  general,  an  energy  transfer  from  the 
environment  to  the  organism.  Much  of  the  time,  organisms  must 
"defend"  themselves  against  this  continuous  bombardment  In  one 
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way  or  another.  One  of  the  most  effective  defenses  is  the 
mechanism  called  adaptation. 

We  can  hardly  enter  here  Into  a technical  discussion 
of  the  various  kinds  of  adaptation  that  exist  and  of  the  chem- 
ical changes  that  seem  to  underlie  the  phenomenon  of  adapta- 
tion. Suffice  it  to  say  that  in  practically  all  instances 
an  organism's  response  to  a continued  stimulus  decreases  after 
the  initial  response  burst . 

Adaptation  exists  apparently  at  all  levels  of  organiza- 
tion of  living  organisms . There  is  adaptation  at  the  level 
of  a single  nerve  fiber,  there  are  adaptation  phenomena  in 
whole  populations  of  nerve  fibers,  and,  finally,  there  are 
phenomena  designated  as  adaptation  that  Involve  the  behavior 
of  the  total  organism.  This  semantic  confusion  should  not 
lead  us  to  assume  that  these  different  processes  have  neces- 
sarily the  same  course  or  are  otherwise  quantitatively  similar. 

Adaptation  exists  in  all  of  man's  sense  departments 
though  apparently  to  different  extents . We  are  all  familiar 
with  the  fact  that  people  adapt  considerably  faster  to  smells 
and  skin  pressures  than  to  optical  and  acoustical  stimuli . 

This  state  of  affairs  has  important  consequences ; since  the 
senses  of  vision  and  audition  relay  most  of  the  information 
that  man  receives  about  the  world  in  which  he  lives . 

You  will  remember  that  in  a masking  situation  man's 
sensitivity  to  acoustic  stimuli  is  reduced  while  the  masker 
is  present.  This  has  been  known  as  the  "line-busy"  effect. 

You  will  also  remember  that  this  reduced  responsiveness  is 
not  identical  for  various  stimuli.  It  is  clear,  however,  that 
a cessation  of  the  masker  will  not  bring  about  an  immediate 
recovery  of  the  organism's  normal  condition.  In  contrast  to 
vacuum  tube  circuits,  biological  entitles  do  not  respond 
instantaneously,  in  the  sense  of  microseconds.  You  may  have 
heard  of  something  that  sounded  to  you  like  an  organism's 
time  constants;  you  may  be  familiar  with  the  terms  absolute 
refractory  period  and  relative  refractory  period.  You  may 
even  remember  that  figures  of  the  order  of  a millisecond  are 
representative  of  the  absolute  refractory  period  of  a nerve, 
i.e.,  of  the  period  during  which  no  stimulus,  regardless  of 
its  strength,  is  presumed  to  be  able  to  give  rise  to  a response. 

You  probably  know  from  your  own  experience  that  it  takes 
much  longer  than  a millisecond  to  recover  one's  normal  sensi- 
tivity after  an  intense  flash  of  light  or  after  a strong  burst 
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of  sound,  provided  the  conditions  of  exposure  permit  recovery I 
While  there  are  figures  in  the  literature  for  relative  refrac- 
tory periods,  these  figures  explain  at  best  only  a very 
limited  category  of  after-effects  of  exposure . We  have  to 
accept  the  uncomfortable  fact  that  after-effects  of  exposure 
come  In  all  sizes,  that  some  of  the  after-effects  are  reversi- 
ble while  others  are  not,  and,  finally,  that  even  all  the  re- 
versible effects  are  not  necessarily  explainable  In  terms  of  a 
single  or  simple  physiological  mechanism. 

In  view  of  these  observations,  we  should  not  be  surprised 
that  workers  In  audition  have  an  entire  arsenal  of  more  or  less 
confusing  terms  to  describe  the  after-effects  of  stimulation. 

You  will  find  papers  on  acoustic  trauma,  on  noise-induced 
deafness,  on  stimulation,  temporary  or  experimental  deafness, 
on  nolse-lnduced  hearing  loss,  on  auditory  or  post-stimulatory 
fatigue*  (sometimes  also  called  short  time  auditory  fatigue), 
on  residual  or  after-effect  masking,  on  threshold  shifts,  on 
auditory  adaptation  and  so  on.  The  authors  of  these  papers 
are  not  trying  to  be  contrary  or  to  make  things  tough  for  their 
readers  or  themselves . They  are  faced  with  such  a wide  range 
of  observable  phenomena  that  they  feel  It  necessary  to  invent 
a small  dictionary  for  their  description. 

The  Parameters  of  Exposure  and  Recovery.  Prom  the  above 
remarks.  It  should  be  clear  that  we  need  to  specify  in  detail 
the  conditions  under  which  a group  of  individuals  has  been 
exposed  to  acoustic  stimulation,  and  the  conditions  under 
which  their  hearing  will  be  tested  before  we  can  make  statis- 
tically valid  predictions  of  their  response  characteristics 
at  any  given  time  after  exposure . 

How  are  we  going  to  specify  the  exposure  stimuli?  Since 
studies  involving  loss  of  auditory  functions  have  been  con- 
ducted both  In  the  laboratory  and  In  the  field  (l.e.,  in  factories, 
airplanes,  submarines,  on  the  rifle  range,  etc.)  It  is  necessary 
to  specify  carefully  the  way  in  which  the  acoustic  exposure 
stimuli  have  been  measured.  It  is  not  unreasonable  to  expect 
that  laboratory  experiments,  using  earphones,  will  be  carried 
out  In  the  future  under  such  conditions  of  calibration  that 


* Hood  uses  this  term  In  conjunction  with  the  term  ”per- 
stlmulatory  fatigue"  - a type  of  fatigue  that  Is  de- 
tected by  measurements  carried  on  during  application 
of  the  stimulus . 
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sound  pressure  levels  at  the  eardrum  (or  equivalent  data) 
will  be  available . This  was  by  no  means  always  the  fact 
In  past  studies  where  sometimes  research  workers  are  said 
to  have  mistaken  decibels  of  hearing  loss  read  from  dials 
of  an  audiometer  for  sound  pressure  levels.  If  loudspeakers 
are  used  as  laboratory  sources,  the  specifications  of  the 
actual  stimulus  becomes  somewhat  more  difficult  unless 
probe-tubes  Inserted  In  the  ear  (or  ears)  of  the  subjects 
are  used  as  continuous  monitors. 

Care  has  to  be  used  when  so-called  free-fleld  values 
are  specified  without  the  necessary  precautions  having  been 
taken*.  It  should  be  clear  that  such  "free-fleld"  data 
would  have  to  be  corrected  before  they  can  become  really 
compared  with  earphone  data  (see  Pollack  35/)  and  even  then 
there  would  remain  the  fact  that  the  subject's  head  movements 
might  make  It  difficult  to  state  the  exact  conditions  of 
exposure  unless  a running  record  of  SPL's  were  taken.  The 
situation  Is  very  much  the  same  (only  much  less  definite) 
when  one  attempts  to  specify  sound  pressure  levels  In  field 
studies.  Unless  particular  precautions  are  taken  or  unless 
one  Is  In  the  presence  of  rather  unusual  conditions  In  which 
the  sound  field  Is  thoroughly  homogeneous,  one  should  really 
only  be  satisfied  with  probe-tube  data,  l.e.,  sound  pressure 
levels  measured  at  the  eardrum. 

So-called  average  data  for  the  sound  pressure  levels 
of  a factory,  for  Instance,  are  acceptable  when  deviations 
are  stated  and  when  some  Idea  Is  given  of  the  way  In  which 
this  average  was  determined.  However,  this  entire  discussion 
of  the  method  of  specifying  sound  pressure  levels  would  prove 
rather  useless  unless  we  examined  the  response  characteristics 
of  the  ultimate  Indicator  In  our  measuring  device . There  Is 
little  problem  If  our  exposure  stimulus  Is  either  a pure  tone 
or  a reasonably  well  behaved  white  noise.  Our  standard  sound 
level  meter  gives  us  reasonably  good  Indication  as  long  as 
the  exposure  stimuli  do  not  fluctuate  too  wildly  36/.  However, 
as  soon  as  we  deal  with  noises  In  which  there  Is  an  appreciable 
peak  factor  (for  Instance,  the  noise  that  Is  generated  by  a 
drop  hammer,  described  In  Sec.  7.^)  Indicating  meters  are 
too  sluggish  and  thus  In  a certain  sense  "blind" . Under 
these  circumstances  the  best  way  of  measuring  the  exposure 


* For  an  up-to-date  treatment  of  the  way  In  which  such 
measurements  ought  to  be  carried  out  see  Williams 
and  Cox  3^/. 
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stimulus  consists  in  recording  the  sound  pressure  wave  by 
means  of  a carefully  calibrated  microphone -amplifier- 
oscilloscope  unit*. 

We  certainly  need  to  know  more  than  overall  SPL's  If 
we  want  to  relate  man's  Impaired  auditory  functions  to  the 
exposure  characteristics.  Just  what  kind  of  frequency 
resolving  power  our  analyzer  should  have  is  not  entirely 
certain  at  present.  Somewhere  between  levels  per  cps  and 
overall  level  there  should  be  a usable  bandwidth  that  will 
yield  meaningful  correlations  with  loss  of  auditory  functions. 
There  is  no  particular  reason  for  assuming  that  the  critical 
bands  used  In  predicting  auditory  performance  under  conditions 
of  masking  will  be  the  most  useful  concept  In  a computational 
scheme  predicting  recovery  functions . Most  analyzing  equip- 
ment yields,  as  we  have  seen  in  Volume  I,  octave-band  levels, 
and  some  analyzers  go  down  to  half  or  even  third  octaves . 

Let  us  turn  now  to  the  specification  of  the  time 
characteristics  of  the  exposure  stimulus . If  the  amplitude 
distributions  of  the  sound  do  not  fluctuate  too  much,  l.e. 

If  the  flow  of  sound  energy  is  relatively  steady,  specifica- 
tion of  the  time  characteristics  presents  no  problem. 

However,  if  the  flow  of  sound  energy  Is  Irregular,  It  Is 
not  sufficient  to  specify  only  the  energy  the  auditory 
system  absorbs . We  must  also  consider  how  this  energy  Is 
packaged  in  time . At  present  we  are  not  ready  to  assign 
definite  Integrating  characteristics  to  a meter  that 
measures  exposure  to  noise,  since  we  do  not  have  sufficient 
experimental  data  on  the  significance  of  the  time  character- 
istics of  the  noise . We  must  realize  that,  at  present,  a 
single  number  (be  It  sound  pressure  level,  energy,  or  the 
product  of  sound  pressure  level  and  time)  Is  not  necessarily 
going  to  do  the  job  of  specifying  exposure. 

Next,  let  us  examine  the  parameters  that  come  into  play 
when  we  are  trying  to  assess  recovery  from  exposure.  Let  us 


In  field  measurements  of  Impulsive  sounds  or  of  sounds 
of  short  duration,  the  equipment  required  to  perform 
accurate  on-the-spot  measurements  Is  rather  bulky. 
Frequently  It  Is  more  convenient  to  record  the  sounds, 
along  with  suitable  calibrating  signals,  on  a tape 
recorder.  The  recordings  can  then  be  thoroughly 
analyzed  In  the  laboratory  at  a later  time,  and  the 
necessity  of  transporting  the  analyzing  equipment  Is 
thus  avoided. 
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assume  our  subjects  have  been  exposed  to  a stimulus  whose 
characteristics  are  well-defined,  and  that  we  are  Interested 
in  the  recovery  of  their  absolute  threshold  curve.  You  may 
be  Inclined  to  ask:  What  else  could  we  possibly  be  interested 

in?  We  could  be  Interested  in  practically  any  other  measure 
of  auditory  performance,  and  there  are  quite  a few  as  we 
have  already  seen  in  Chapter  l6.  Our  conclusions  might  be 
rather  different  depending  upon  the  measure  of  performance 
against  which  we  calibrate  our  recovery  studies.  A recent 
paper  by  van  Dlshoeck  37/  bears  out  this  contention. 

Let  us  consider  first  the  recovery  of  the  absolute 
threshold  after  exposures  that  do  not  produce  Irreparable 
damage.  One  of  the  first  questions  that  arises:  How  far 

in  time  are  we  to  follow  the  recovery?  Must  we  Insist  upon 
a post -exposure  threshold  that  comes  within  1 or  2 db  of  the 
pre-exposure  threshold?  If  we  like  that  much  accuracy,  we 
cannot  use  most  of  the  standard  audiometers  since  their 
smallest  Intervals  are  5 steps.  Suppose,  now,  a subject 
has  recovered  within  the  limits  that  we  have  set;  does  such 
a recovery  guarantee  that  his  threshold  will  not  get  worse 
again?  Not  necessarily,  if  we  take  into  account  the  findings 
obtained  by  Bronsteln  38/,  and  more  recent  data  from  Hirsh 
and  Ward  39/.  In  other  words,  we  must  make  certain  that  the 
recovery  of  the  threshold,  once  complete,  is  not  going  to  be 
upset  again.  We  could,  of  course,  measure  the  recovery  of 
the  threshold  in  a different  manner.  We  might  take  thresholds 
at  fixed  time  intervals  after  the  end  of  the  exposure.  Such 
a technique  has  to  be  used  for  experiments  in  which  recovery 
occurs  in  a rather  short  time  (several  seconds  or  even 
fractions  of  a second). 

After  exposure  to  a given  sound,  the  recovery  of  the 
subjects  must  be  measured  by  some  kind  of  a test  stimulus . 

So  far,  we  have  not  touched  upon  the  question  of  the  relation 
between  exposure  stimuli  and  test  stimuli . There  are  almost 
unlimited  possibilities . We  can  start  out  by  making  the 
exposure  stimulus  and  the  test  stimulus  the  same . We  could 
choose,  among  pure  tones,  a test  stimulus  that  is  anywhere  in 
the  audible  frequency  domain,  l.e.,  it  could  be  either  higher 
or  lower  than  our  exposure  stimulus . However,  we  do  not 
necessarily  have  to  test  with  a pure  tone;  we  might  choose 
Instead  a band  of  noise,  a click,  a tone  pip  or  even  speech. 
There  is  no  reason  why  we  should  expect  these  different 
procedures  to  yield  identical  results.  As  a matter  of  fact, 
there  are  certain  conditions  in  which  thresholds  for  certain 
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test  stimuli  improve  for  a certain  time  interval  (so-called 
sensitization)  while  the  same  exposure  stimulus  will  at  the 
same  Instant  in  time  produce  a rather  serious  hearing  loss 
for  other  test  stimuli. 

It  becomes  clear  that  no  single  number  can  describe 
recovery  any  more  than  a single  number  can  describe  exposure . 
Both  exposure  and  recovery  are  complex  multi -dimensional 
entities.  Hence  we  find  it  understandable  that  we  cannot 
give  a simple  account  of  what  happens  as  a .consequency  of 
exposure  to  noise . 

Thus  far  we  have  talked  about  the  effects  of  exposure 
upon  the  absolute  threshold  only.  It  should  be  clear  that  we 
could  and  probably  should  pick  other  aspects  of  man's  auditory 
performance  if  we  want  to  be  able  to  describe  realistically 
the  after-effects  of  exposure.  Let  us  enumerate  Just  a few 
of  the  functions  that  have  been  investigated,  though  in  most 
Instances  rather  spottily. 

Some  work  has  been  done  on  the  effects  of  exposure  upon 
loudness*,  upon  pitch  perception  and  upon  localization  of  sound 
sources  in  space.  Studies  describing  the  effects  of  exposure 
upon  the  masked  threshold  (for  speech  let  us  say),  on  Just 
noticeable  differences  or  on  temporal  auditory  resolving  power 
are  still  lacking.  There  are,  however,  a few  not  too  sys- 
tematic observations  on  subjective  noises  (such  as  tinnitus) 
and  dlplacusis  as  after-effects  of  exposure. 

Having  considered  in  some  detail  the  parameters  of 
exposure  and  recovery,  we  must  still  give  an  adequate  descrip- 
tion of  the  condition  of  the  individual  (or  the  group  of 
individuals)  who  has  been  exposed  and  whose  recovery  is  being 
tested.  For  example,  we  must  find  out  how  normal  our  subjects' 
audiograms  were  at  the  start  of  the  exposure . We  should 
require  a complete  otologic  examination  of  our  subjects  and 
obtain  if  possible  a diagnosis  and  classification  of  any  per- 
manent hearing  losses  with  which  they  are  afflicted.  We  should 
know  the  age  of  our  subjects  since  there  are  some  indications 
that  different  age  groups  are  not  equally  sensitive  to  exposure 


The  so-called  recruitment  phenomenon  in  nerve 
deafened  patients  (see  definition  in  Ch.  15 J 
see  also  Sec.  l6.3)  has  attracted  a good  deal 
of  attention. 
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Next,  we  might  want  to  inquire  into  our  subjects' 
previous  history  of  exposure  and  even  into  their  heredity 
from  the  point  of  view  of  auditory  defects . If  data  exist 
showing  the  subjects'  previous  recovery  functions,  we  should 
want  to  know  if  they  were  shown  to  be  particularly  sensitive, 
if  previous  exposures  have  given  rise  to  cumulative  losses, 
if,  in  particular,  at  the  beginning  of  the  exposure  the  sub- 
jects ' hearing  was  normal  compared  to  their  own  standard 
(for  example,  some  subjects  might  be  affected  by  a cold).  We 
should  also  like  to  know  something  concerning  the  effective- 
ness of  the  subjects'  middle  ear  muscles,  but  unfortunately 
there  is  no  good  test  available  at  present  that  will  give  us 
such  information. 

The  Effects  of  Exposure.  The  literature  on  the  broad 
subject  of  loss  of  auditory  function  after  exposure  to  sound 
is  inordinately  large  (see  Ref.  4l/) . There  are  literally 
hundreds  of  articles  dealing  with  observations  made  on  animals, 
with  reports  of  clinical  cases,  with  laboratory  experiments  on 
man  and  with  sweeping  (though  .mostly  not  too  well  controlled) 
industrial  surveys . Furthermore  the  period  after  World  War 
II  has  seen  the  publication  of  at  least  four  Important  mono- 
graphs in  this  general  area:  Ruedl  and  Purrer  42/,  Davis  et 

al  43/,  Thellgaard  44/  and  Hood  43/.  Then  there  are  the  series 
of  important  articles  by  Luscher  and  Zwlslockl  46,47,48/,  the 
Munson  and  Gardner  investigation  of  "loudness  patterns*',  the 
Gauss  and  Chavasse  study  50/,  and  the  series  of  reports  on 
short  duration  auditory  fatigue  by  J.  D.  Harris  and  his  group 
51,52,33,34/;  all  of  these  papers  are  related  in  some  sense  to 
the  de  Mare's  monograph  55/  whose  pioneering  Influence  has 
been  felt  now  for  almost  15  years . 

In  view  of  the  space  that  we  have  available  here,  it 
would  be  presumptlous  to  review  critically  the  findings  of 
so  many  experiments  and  surveys.  This  is  especially  true  since, 
according  to  our  present  impression,  no  overall  valid  generaliz- 
ations could  be  expected  to  derive  from  such  an  undertaking. 

The  various  studies  we  mentioned  (and  others  too  numerous  to 
mention)  have  produced  a large  accumulation  of  data;  unfortunately 
most  of  these  data  are  not  strictly  comparable  since  they  have 
not  been  collected  under  anything  like  comparable  conditions 
and  since  only  very  few  of  the  data  overlap.  However,  these 
various  studies  have  posed  serious  problems  and  it  is  essentially 
with  some  of  these  problems  that  we  shall  be  concerned. 

Our  major  concern  here  is  with  the  type  of  permanent 
perceptive  or  nerve  deafness  that  represents  a significant 
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change  in  the  normal  presbycousls  pattern.  This  type  of  high 
frequency  hearing  loss  is  found  in  many  noisy  occupations 
(including  military  services)  in  which  humans  are  exposed  to 
intense  noise  repeatedly  and  for  considerable  periods  of  time . 
However,  our  Interest  extends  beyond  the  causal  factors  of 
permanent  partial  deafness.  We  should  like  to  understand  its 
genesis,  its  relation  to  temporary  threshold  shifts,  its  rela- 
tion to  the  intensity  level,  the  spectrum  and  the  time  course 
of  exposure  - the  three  most  Important  parameters  of  exposure . 

We  should  also  like  a better  understanding  of  the 
following  questions:  Are  there  cumulative  effects  of  exposure 

even  after  the  threshold  has  returned  to  normal?  Are  there 
certain  age  groups  that  are  particularly  vulnerable  to  exposure? 
What  about  the  sensitivity  to  noise  of  people  who  already  have 
a certain  amount  of  nerve  deafness?  Finally,  we  should  attempt 
to  gain  some  insight  into  the  concomitants  of  variability  in 
individual  susceptibility  to  exposure,  if  for  no  other  reason 
than  to  be  able  to  design  so-called  predictive  tests  separating 
"tough  ears"  from  "tender  ears". 

It  is  hardly  realistic  to  tackle  even  half  of  the  ques- 
tions that  we  have  raised.  Let  us  get  back  to  a question  we 
posed  earlier:  What  happens  in  the  wake  of  any  acoustic 

stimulus? 

For  a partial  answer  to  this  question,  we  go  to  laboratory 
studies  practically  all  of  which  employ  pure  tones  as  exposure 
stimuli  and  either  pure  tones  or  short  pure  tones  (so-called 
tone  pips)  as  test  stimuli  for  the  recovery.  At  levels  of  the 
exposure  stimulus  that  are  not  too  far  above  the  subject's 
threshold  there  occurs  a threshold  shift  at  the  exposure  fre- 
quency and  at  neighboring  frequencies.  This  threshold  shift 
persists  for  a fraction  of  a second  and  is  reasonably  inde- 
pendent of  the  duration  of  the  exposure.  The  initial  threshold 
shift  is,  however,  rather  closely  related  to  the  sensation 
level  of  the  exposure  stimulus  (see  Fig.  17.I).  The  spread  of 
frequencies  for  which  a threshold  shift  occurs  is  reasonably 
symmetrical  at  these  low  intensities  of  the  exposure  stimulus. 
There  is  some  disagreement  as  to  whether  these  threshold  shifts 
are  equal  in  magnitude  at  low  and  at  high  frequencies.  Most 
of  the  argument  revolves  around  the  way  in  which  the  exposure 
stimuli  ought  to  be  described:  Should  we  compare  recovery  of 

the  threshold  for  equal  SPL's,  equal  sensation  levels  or  equal 
loudness  levels? 
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As  we  Increase  the  intensity  of  the  exposure  stimulus, 
the  recovery  process  lengthens . This  lengthening  is  apparently 
accompanied  by  two  features:  (a)  recovery  at  low  frequencies 

seems  to  occur  quicker  than  at  the  higher  frequencies;  (b)  the 
threshold  shift  now  becomes  somewhat  less  symmetrical . In  this 
respect  the  pattern  follows  masking  contours  for  strong  stimuli 
with  spread  of  the  effect  on  the  high  frequency  side. 

Once  a certain  stimulus  Intensity  has  been  exceeded,  the 
threshold  shift  for  a given  recovery  interval  seems  to  become 
sensitive  to  the  duration  of  exposure  (see  Fig.  17.1). 

As  we  Increase  both  Intensity  and  duration  of  the  exposure 
stimulus  (we  are  now  dealing  with  seconds  or  minutes  and  not 
with  fractions  of  seconds)  further,  the  threshold  shift  at  a 
given  recovery  Interval  increases  suddenly  (see  Pig.  17. 2A). 

The  bend  in  the  curve  does  not  seem  to  occur  at  the  same 
stimulus  intensity  on  even  the  same  sensation  level  for  all 
subjects  but  it  seems  to  make  Its  appearance  somewhere  in  the 
vicinity  of  90  db  sensation  level.  For  such  stimulus  intensities 
the  .pattern  of  the  pure -tone  threshold  as  a function  of  fre- 
quency changes  in  character.  While  previously  (for  very  short 
recovery  Intervals)  the  maximum  effect  was  found  at  the  frequency 
of  exposure,  the  maximum  threshold  shift  for  considerably  longer 
recovery  Intervals  has  shifted  upward  In  frequency. 

This  characteristic  asymmetrical  pattern  with  practically 
no  loss  at  frequencies  lower  than  the  exposure  frequency  (some- 
times accompanied  by  sensitization  at  the  exposure  frequency 
itself)  has  been  dignified  by  the  name  of  "half -octave  low". 
According  to  this  rule,  the  greatest  hearing  loss  occurs  at 
about  1/2  octave  above  the  exposure  frequency;  hearing  loss  may, 
however,  extend  as  much  as  3 or  even  4 octaves  above  the  expo- 
sure frequency.  The  fact  that  at  some  time  during  recovery 
the  maximum  loss  Is  found  1/2  octave  above  the  exposure  fre- 
quency does  not  Imply  that  It  will  In  the  further  course  of 
the  recovery  remain  tied  to  this  frequency. 


Figure  17-1 

Effect  of  duration  of  exposure  tone  (500  cps)  upon 
the  threshold  shift  of  a 1000  cps  tone  pip  of  dura- 
tion 30  milliseconds.  The  silent  Interval  between 
tones  is  20  milliseconds.  The  figures  to  the  right 
of  the  curves  are  the  parametric  values  of  the  sensa- 
tion level  of  the  exposure  tone.  (After  Rawnsley 
and  Harris  53/) • 
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What  happens  as  we  further  Increase  the  SPL  of  our 
exposure  stimulus  as  well  as  its  duration?  Laboratory  experi- 
mentation tells  us  of  the  effects  of  exposures  up  to  l40  db 
SPL  at  durations  up  to  about  one  hour.  Most  of  the  data  are 
to  be  found  in  the  Ruedl  and  Furrer  42/  and  Davis  et  al  43/ 
monographs.  As  we  might  anticipate,  such  exposures  result  in 
severe  temporary  hearing  losses  from  which  subjects  recover 
after  a few  hours,  days  or  weeks. 

Where,  will  the  reader  ask,  is  the  point  of  no  return, 
l.e.,  where  will  the  Induced  hearing  loss  no  longer  remain 
entirely  temporary?  For  rather  obvious  reasons  we  do  not  have 
the  answer  to  this  question.  Few  are  the  subjects  who  can 
testify  to  the  fact  that  Irreversible  threshold  shifts  can  be 
acquired  in  the  laboratory.  Responsible  experimenters  watch 
out  for  such  things.  But  even  if  a few  such  cases  occurred, 
who  could  tell  which  of  a series  of  exposures  broke  the  inner 
ear's  back?  Cumulative  effects  should  certainly  not  be  over- 
looked in  such  a situation. 

It  is  almost  intuitively  clear  that  there  exists  some- 
thing like  an  elastic  limit,  l.e.,  a SPL  that  once  exceeded 
produces  Irreversible  changes  in  a subject's  ears.  For  man 
data  are  scarce,  but  the  region  is  apparently  somewhere  above 
155  db.  For  anesthetized  animals  it  may  be  closer  to  150  db 
SPL. 


Figure  17.2 

A.  Threshold  shifts  as  a function  of  stimulus  inten- 
sity after  a one  minute  exposure  to  a 2048  cps  tone. 
Data  points  represent  averages  for  three  subjects. 
Recovery  is  tested  at  the  frequency  of  the  expo- 
sure stimulus. 

B.  Threshold  shifts  as  a function  of  the  duration  of 
exposure  for  four  different  subjects.  Exposure 
tone  and  test  tone  again  have  the  same  frequency 
(2048  cps).  The  sensation  level  of  the  exposure 
tone  was  100  db . (After  Hood  43/) . 
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While  we  can  conceive  of  such  a limit  as  something 
almost  almost  Independent  of  the  duration  of  the  exposure, 
we  are  at  an  almost  complete  loss  when  asked  to  specify  what 
kind  of  an  intensity-time  relation  we  suggest  for  an  "equln- 
oxlous”  contour.  We  feel  that  we  are  safe  in  saying  that  the 
higher  the  SPL  (below  the  critical  limit)  the  shorter  the 
exposure  time  necessary  to  produce  Irreversible  effects.  We 
have  little  faith  that  a linear  relation  for  such  effects  will 
exist  as  is  shown  in  Fig.  17.2B  for  reversible  effects..  As  a 
matter  of  fact,  other  authors,  using  average  hearing  loss 
throughout  a certain  frequency  range  as  an  Indicator  of  the 
temporary  damage,  disagree  with  the  findings  represented  by 
17 .2B.  Too  many  uncontrolled  factors  exist  in  the  picture 
of  the  development  of  permanent  partial  deafness  to  permit  us 
to  apply  a simple  I x t rule . 

The  question  is  often  asked:  What  is  the  relation  between 

the  temporary  (reversible)  losses  that  can  be  produced  in  the 
laboratory  and  the  permanent  (irreversible)  effects  observed 
in  Industry,  for  Instance?  Is  the  same  process  at  work  in  the 
two  situations?  The  preceding  description  of  the  effects 
of  exposure  emphasized  both  the  continuity  of  the  phenomena 
and  the  changing  character  as  the  exposures  grew  more  severe . 

We  shall  refrain  from  making  the  superficially  simple 
assumption  that  all  throughout  the  auditory  range  there  is 
only  a single  exposure-recovery  process  at  work.  We  shall 
also  refrain  from  making  assumptions  as  to  where  in  the  inner 
ear  this  single  process  is  to  be  found.  We  need  a good  deal 
more  information  before  such  queries  can  be  answered  meaning- 
fully. The  mere  fact  that  inter-observer  variability  in 
threshold  shift  Increases  as  exposures  become  more  severe, 
added  to  the  fact  that  different  observers  recover  very 
differently  from  what  is  apparently  the  same  loss.  Invites 
caution . 

In  view  of  the  variability  among  individuals  and  the 
great  number  of  uncontrolled  factors,  the  whole  matter  should 
be  treated  statistically.  In  Sec.  l8.4  we  have  attempted  an 
empirical  approach  tc  this  problem  by  trying  to  establish  a 
damage  risk  criterion  for  the  control  of  noise. 

Another  gap  in  our  knowledge  needs  to  be  filled  before 
we  tackle  the  problem  of  the  relation  of  reversible  to 
irreversible  threshold  shifts.  Most  of  the  studies  of 
temporary  loss  used  pure  tones  as  exposure  stimuli.  The 
few  data  on  exposure  to  noise  (Davis  et  al  43/j  Postman  and 
Egan  36/)  or  to  complex  sounds  (Ruedl  and  Furrer  42/)  do  not 
provide  enough  continuity  between  laboratory  experiments  and 
actual  exposures  in  Industry. 
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On  several  occasions  Vue  have  referred  to  the  truly- 
amazing  amount  of  variability  that  seems  to  exist  among  indi- 
viduals with  respect  to  both  permanent  and  temporary  losses. 
Audiometric  examinations  of  workers  in  noisy  industries  show 
time  after  time  that,  out  of  a group  of  men  who  have  apparently 
been  exposed  to  the  same  noise,  some  will  show  marked  symptoms 
of  deafness,  while  the  hearing  of  others  will  remain  practically 
normal,  even  after  conditions  of  severe  exposure.  There  is  at 
present  no  good  rationale  for  these  facts.  They  Just  have  to 
be  accepted. 

Peyser  57/t  Wilson  ^8/,  and  more  recently  Thellgaard  40/ 
have  tried  to  design  predictive  tests  on  the  basis  of  which 
noise-susceptible  individuals  could  be  separated  from  others. 
Thellgaard 's  results  represent  the  most  comprehensive  investi- 
gation with  pure  tones  as  exposure  stimuli.  Unfortunately, 
his  results  are  not  encouraging. 

Thellgaard  finds  that  not  only  is  there  variability 
between  subjects,  and  between  exposures  for  a single  subject, 
but  there  is  also  striking  variability  between  the  effects  of 
exposure  upon  the  two  ears  of  the  same  subject.  (Wheeler  59/ 
confirms  this  finding.)  There  is  even  an  indication  that 
susceptibility  to  noise-induced  hearing  loss  may  change  with 
frequency  in  the  same  individual.  Thellgaard  concludes  that 
there  is  no  reason  to  assume  that  a generally  valid  prediction 
for  a person’s  susceptibility  to  noise  can  be  based  upon  the 
amount  of  temporary  loss  found  at  one  frequency. 

Wheeler  59/>  in  this  country,  has  also  worked  on  the 
problem  of  the  desirable  characteristics  for  a practical 
predictive  test.  Figure  17.3  shows  some  of  the  results  he 
obtained.  Wheeler  expresses  his  preference  for  a complex  noise 
as  the  exposure  stimulus.  He  suggests  that  the  test  frequencies 
be  confined  to  the  high  frequency  region,  and  that  the  criteria 
of  susceptibility  be  based  upon  both  initial  threshold  shifts 
and  rate  of  recovery.  Wheeler  points  out  that  even  if  we 
decided  today  upon  a standardized  predictive  test  we  would 
not  know  how  far  we  could  really  trust  it . The  validation 
procedure  would  require  years  of  follow-up  studies  in  industry. 
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Figure  17.3 

A.  Initial  threshold  shifts  at  different  frequencies,  ob- 
served after  exposure  to  30  minutes  of  thermal  noise 
at  105  db  SPL.  The  exposure  stimulus  was  delivered 
over  a loudspeaker.  Each  bar  represents  the  mean  of 
three  exposures  for  a different  subject. 

B,  Recovery  curves  for  nine  subjects  (three  at  each 
frequency) . Open  circles  are  threshold  shifts  for 
right  ears,  while  solid  circles  indicate  threshold 
shifts  for  left  ears.  Each  experimental  point  Is  the 
mean  of  nine  post-exposure  thresholds.  (After  Wheeler 
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CHAPTER  18 


HUMAN  RESPONSE  CRITERIA  FOR  NOISE  CONTROL 


l8 .1  Introduction;  The  Establishment  of  Engineering 
Criteria  for  Noise  Control 


In  the  preceding  chapters  we  have  tried  to  assess  the 
effect  of  sound  on  various  aspects  of  man's  behavior.  In 
Chapters  l6  and  17  we  have  been  concerned  with  both  psycho- 
logical and  physiological  functions . We  have  seen  how  only 
too  often  we  are  rather  far  from  a complete  understanding 
of  the  Interaction  between  sound  and  man.  In  some  areas 
the  basic  Information  Is  available  and  we  have  only  to  plug 
In  particular  values*  In  order  to  predict  just  how  man's 
actions  will  be  affected.  In  other  areas,  particularly  those 
concerned  with  the  effect  of  what  we  might  call  "very  Intense 
sound  Indeed,"  we  are  relatively  Ignorant  of  the  basic  non- 
audltory  phenomena.  There  Is  some  hope,  however,  that  the 
next  few  years  will  witness  an  accumulation  of  reliable  data 
In  this  field  whose  Importance  from  the  viewpoint  of  military 
effectiveness  Is  scarcely  In  doubt. 

We  have  also  tried  to  point  out  that  even  In  the  areas 
in  which  the  basic  Information  Is  available  there  Is  still 
often  considerable  variability  among  individuals,  with  the  ex- 
ception perhaps  of  the  problems  involving  speech  communication. 
In  many  instances  we  are  unable  to  predict  how  a given  sound 
will  affect  a particular  Individual . This  means  that  we  are 
not  yet  ready  to  select  people  for  a given  job  in  a manner 
that  will  realistically  take  Into  account  their  Individual 
susceptibility  to  sound. 

But  when  all  these  warnings  have  been  given,  when  all  the 
qualifications  have  been  put  down  on  paper,  the  fact  remains 
that  people  have  to  make  decisions  In  this  area  that  Involves 
the  Interaction  of  noise  and  man.  Some  people  have  the  re- 
sponsibility for  deciding  where  a given  airfield  is  to  be 
located;  others  have  to  make  sure  that  speech  communication 


An  example  Is  a situation  in  which  a specifiable 
masking  noise  interferes  with  the  reception  of  a 
specifiable  sound.  Figure  16.17  summarizes  one 
of  the  Instances  for  which  a satisfactory  generaliza- 
tion or- computational  scheme  exists. 
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will  be  possible  at  a certain  distance  from  a Jet  engine  run- 
ning at  full  power;  still  others  have  to  persuade  some  of  their 
fellow  workers  of  the  necessity  of  wearing  ear  protection.  This 
dilemma  could  be  paraphrased  - with  apologies  to  Emerson  - 
"When  theory  whispers  low,  'you  can't',  practice  replies,  'I 
must  1 ' " 

Under  such  circumstances  there  Is  a temptation  to  base 
all  Judgments  upon  a single  criterion  (overall  sound  pressure 
level,  or  loudness,  for  Instance)  and  to  make  decisions  that 
one  would  assume  to  be  at  least  consistent  If  nothing  else. 
Unfortunately  there  are  no  single  magic  numbers  that  permit  us 
to  arrive  at  policy  decisions  with  respect  to  noise  control . 

As  we  have  seen  In  our  previous  discussion,  "noise"  Is 
basically  unwanted  sound,  l.e.,  sound  that  Interferes  with  an 
activity  that  we  want  to  carry  on.  However,  that  Interference 
has  to  be  specified.  A given  sound  Is  not  equally  effective 
In  masking  speech.  In  Interfering  with  sleep,  or  In  producing 
permanent  hearing  loss. 

In  recent  years  there  has  been  a growing  tendency  to 
assess  the  effect  of  a noise  by  Its  loudness.  We  have  seen 
In  Sec.  16.2  that  It  Is  by  no  means  simple  to  arrive  at  loud- 
ness Judgments  for  sounds  having  a complex  spectrum  and  a 
rather  Irregular  character  In  time.  But  let  us  assume  that 
by  some  carefully  worked  out  procedure  we  should  be  able  to 
overcome  these  difficulties.  We  will  still  be  faced  with  the 
fact  that  loudness  functions  are  not  related  In  the  same  way 
to  such  functions  as  masking,  hearing  loss,  or  even  speech 
Intelligibility. 

Your  Intuitive  feeling  may  be  that  all  you  really  object 
to  when  you  hear  a noise  Is  Its  loudness.  In  the  face  of  this 
quite  common  reaction  let  us  make  our  position  more  precise . 
Given  a noise  that  Interferes  with  speech  communication  In  a 
quantifiable  way,  the  same  noise  made  louder  (or  more  Intense) 
will  Interfere  more  with  speech  communication.  In  other  words, 
as  long  as  we  are  not  changing  noises  or  human  functions  In 
mid-stream  we  can  assess  the  Interference  caused  by  a noise 
by  quantifying  It  as  a function  of  Its  loudness.  If,  however, 
we  are  Interested  In  hearing  loss  after  exposure  to  the  noise 
and  not  In  masking  of  speech  we  can  not  offhand  assume  that 
we  are  still  dealing  with  the  same  function.  And  If  we  are 
concerned  with  a different  noise,  we  have  to  make  sure  that  the 
old  noise  and  the  new  noise  can  be  Judged  on  the  same  loudness 
scale  without  obtaining  paradoxical  results. 
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It  Is  apparent,  then,  that  we  have  to  establish  about 
as  many  sets  of  criteria  as  there  are  human  responses  in 
whose  performance  we  are  Interested.  And  that  we  have  to 
specify  in  a rather  detailed  fashion  the  noise  that  Is  inter- 
fering with  a given  response.  Let  us  not  give  up  hope  that 
in  the  course  of  time  we  shall  be  able  to  generalize  our  re- 
sults. At  present  let  us  aim  toward  using  the  criteria  we 
have  suggested  as  framework  in  which  data  can  be  usefully 
collected  and  evaluated. 

We  shall  be  concerned  primarily  with  the  interference 
of  noise  with  three  general  types  of  response  or  behavior. 

People  want  (l)  to  live  in  their  homes  without  being  too  much 
disturbed  by  noise;  (2)  to  communicate  with  each  other  by 
speech,  and,  (3)  to  protect  their  ability  to  hear.  There  are, 
of  course,  situations  in  which  the  response  is  not  easily  broken 
down  into  definite  units  of  behavior.  In  such  instances,  the 
setting  of  criteria  becomes  singularly  difficult. 

18.2  Criteria  for  Residential  Living 

Introduction . In  and  around  our  homes  we  are  constantly 
exposed  to  noise  that  originates  from  sources  both  indoors  and 
outdoors . The  level  of  the  noise  is  often  quite  low,  but  it 
is  rarely  inaudible,  even  in  the  most  remote  suburban  areas. 
Frequently  the  sounds  are  virtually  unnoticed,  i.e.,  we  hardly 
react  to  them,  and  we  accept  them  as  part  of  our  environment . 

The  noise  from  distant  traffic,  the  occasional  passing  of  an 
aircraft  at  a distance  of  5OOO  ft  or  more  and  the  noise  of 
falling  rain,  rustling  leaves  and  crickets  all  belong  to  the 
general  category  of  "ambient  background  noise"  for  a large  ma- 
jority of  people.  Frequently,  however,  a noise  that  is 
unwanted  or  annoying,  to  some  residents  at  least,-  intrudes 
through  the  background  noise . Acoustical  engineers  are  often 
faced  with  the  problem  of  predicting  the  probable  neighborhood 
reaction  to  the  noise  on  the  basis  of  physical  measurements  of 
the  noise.  Their  task  may  be  to  prescribe  measures  for  reduc- 
tion or  control  of  the  noise  at  the  source  so  that  it  does  not 
give  rise  to  expressions  of  annoyance  on  the  part  of  the 
residents,  or  so  that  it  is  at  least  tolerated  by  them. 

There  are  essentially  two  aspects  tp  the  analysis  of  the 
interaction  between  an  intruding  noise  and  human  beings  exposed 
to  the  noise . The  psychologist  would  call  these  aspects  stim- 
ulus and  response . The  stimulus  function  can,  as  a first 
approximation,  be  defined  by  a physical  description  of  the 
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noise  to  which  the  human  beings  are  exposed.  It  may,  however, 
also  be  necessary  to  describe  the  physical  characteristics  of 
noises  to  which  a particular  group  of  people  have  been  exposed 
in  the  past,  in  order  to  evaluate  the  degree  to  which  they 
have  become  adjusted  to  a noisy  environment.  The  response  of 
residents  is  measured  through  expressions  of  annoyance,  com- 
plaints or  even  through  legal  action.  Our  task  here  is  to  be 
able  to  predict  their  response  from  knowledge  of  the  stimulus 
function,  past  and  present. 

In  all  questions  involving  the  interaction  of  noise  and 
man  we  must  be  careful  to  state  the  type  of  behavior  that  is 
Involved.  In  the  present  analysis  it  has  been  assumed  that 
the  intruding  noise  Interferes  with  a generalized  class  of 
behavior  that  may  be  described  as  "residential  living."  The 
scheme  cannot  be  applied  directly  to  cases  that  Involve  noise 
in  the  vicinity  of  hospitals  or  that  primarily  Involve  inter- 
ference with  speech  communication  (in  the  classroom  of  a 
school  in  a residential  area,  for  example).  Interference  with 
speech,  sleep  and  various  other  occupations  and  avocations 
(such  as  listening  to  music)  is  Involved  only  in  the  sense  that 
such  functions  are  part  of  residential  living. 

Ideally,  an  adequate  description  of  the  stimulus  should 
provide  a quantitative  specification  of  all  characteristics 
that  are  likely  to  influence  response  behavior.  It  is  clear 
that  many  variables  are  involved.  Two  Important  variables  are 
the  overall  sound  pressure  level  and  the  frequency  spectrum 
of  the  intruding  noise . Other  typical  factors  influencing 
the  response  are  the  repetitive  character  of  the  noise,  the 
time  of  day  or  night-  at  which  the  noise  occurs,  the  history 
of  the  previous  exposure  to  noise  and  the  origin  of  the  noise . 
Often  certain  aspects  of  the  stimulus  are  best  described  in 
statistical  terms,  since  some  of  the  variables  Involved  may 
fluctuate  over  a range  of  values . 

In  the  following  analysis,  a scheme  is  proposed  to  per- 
mit the  evaluation  and  correlation  of  a certain  number  of 
relevant  data  relating  to  both  stimulus  and  response . The 
scheme  results  In  a specification  of  the  stimulus  in  terms 
of  a noise  rating,  a single  number  that  should  enable  the 
acoustic  specialist  to  predict  the  statistical  range  of  neigh- 
borhood reaction  to  the  noise . Case  histories  and  field  data 
involving  neighborhood  reactions  to  noise  are  cited  as  Justifi- 
cation for  the  proposed  scheme . 
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It  must  be  emphasized  that  the  scheme  presented  here 
Is  largely  empirical,  and  that  it  should  be  regarded  as  one 
of  the  several  possible  frameworks  within  which  data  from 
case  histories  could  be  correlated.  It  Is  clear  that  there 
are  many  limitations,  and  the  procedure  is  subject  to  revision 
and  refinement  as  further  data  become  available. 

In  effect,  the  proposed  framework  provides  us  with  a 
set  of  criteria  for  the  control  of  noise  in  residential  areas. 
In  the  establishment  of  such  criteria,  a decision  must  first 
be  made  concerning  the  acceptable  level  of  complaints  from 
the  community.  Then  the  empirical  scheme  to  be  described  in- 
dicates to  what  extent  the  noise  must  be  reduced  or  modified 
in  order  that  the  complaints  be  kept  below  this  acceptable 
level . 

The  Noise  Rating.  A study  of  evidence  that  has 
accumulated  over  the  years  has  led  us  to  the  conclusion  that 
there  are  at  least  seven  largely  independent  characteristics 
of  a noise  stimulus  that  control  the  response  behavior  of  a 
community  exposed  to  the  noise . The  role  played  by  each  of 
these  seven  variables  in  contributing  to  a final  evaluation 
of  a so-called  noise  rating  is  discussed  in  the  following 
paragraphs . In  general  the  noise  rating  of  a stimulus  deter- 
mines the  statistical  range  of  responses  to  the  stimulus.  In 
addition  we  shall  discuss  certain  other  factors  that  are  likely 
to  modify  the  response  of  a neighborhood  but  which  are  usually 
Important  only  in  rather  special  or  unusual  situations . 

Level  Rank.  The  overall  level  and  the  spectrum  of  the 
noise  are  of  primary  Importance  in  the  determination  of  the 
noise  rating.  In  the  present  analysis,  the  spectra  are  assumed 
to  be  given  as  sound  pressure  levels  in  octave  frequency  bands. 
It  is  assumed  that  the  values  are  obtained  by  averaging  over 
a reasonable  length  of  time  (similar  to  the  averaging  performed 
by  a standard  sound  level  meter)  and  over  a reasonable  number 
of  locations  in  the  community.  Thus  Instantaneous  peaks  of 
sound  pressure  and  levels  in  localized  regions  of  somewhat 
higher  or  lower  intensity  than  the  average  are  smoothed  out . 

The  spectra  may  be  obtained;  (a)  by  direct  measurements  with 
a sound  level  meter  and  octave  band  analyzer,  or  (b)  by  calcu- 
lations derived  from  engineering  data  on  noise  sources  and 
transmission  properties  of  the  relevant  media.  Techniques  of 
measurement  and  data  from  which  one  can  perform  such  calcula- 
tions are  discussed  in  Volume  I. 


WADC  TR  52-204 


183 


Figure  18.1  shows  a family  of  curves  that  define  the 
"level  rank."  The  ranks  are  designated  by  the  letters  from 
A to  M.  We  are  using  the  letters  of  the  alphabet  as  an 
appropriate  rank  order  scale  for  our  level  ranks . This  should 
make  definitely  clear  that  D Is  a higher  rank  than  B,  since  D 
comes  later  in  the  alphabet.  Had  we  chosen  the  natural  numbers 
to  designate  our  level  rank,  one  might  be  tempted  to  assume 
that  a noise  having  a level  rank  of  4 was  twice  as  annoying 
as  a noise  having  the  level  rank  of  2.  This  Is  precisely  what 
we  wanted  to  avoid.  We  feel  that  all  we  can  specify  is  rank 
order  plus,  perhaps,  a crude  scale  of  equal  Intervals,  each 
of  them  having  a value  of  approximately  5 db. 

Each  rank  Is  associated  with  the  area  between  two  neigh- 
boring curves.  At  the  low  end  Is  rank  A,  and  the  lower  boundary 
of  this  rank  is  a smoothed  curve  representing  the  average  thres- 
hold of  hearing  (free  field).  The  highest  rank  Is  M and  the 
upper  boundary  of  this  rank  approaches  the  currently  used 
criterion  of  damage  to  the  hearing  mechanism  under  continuous 
exposure.  (See  Sec.  l8.4).  This  choice  of  scale  Implies  that 
a noise  that  Is  Inaudible  does  not  contribute  to  annoyance 
and  that  a noise  that  may  produce  permanent  damage  to  the  un- 
protected ear  would  In  any  case  be  treated  as  socially  unaccept- 
able whether  or  not  questions  of  annoyance  are  Involved. 

The  measured  or  calculated  octave  band  spectrum  of  the 
Intruding  noise  Is  to  be  superimposed  on  Fig.  18.1.  The  level 
rank  of  the  noise  spectrum  Is  given  by  the  highest  area  into 
which  the  spectrum  intrudes  In  any  band.  This  procedure  Implies 
that  the  noise  level  In  a single  band  (the  one  projecting  Into 
the  highest  area  In  Fig.  l8.1)  can  In  some  cases  determine  the 
level  rank  uniquely.  Essentially,  the  assumption  Is  that  dif- 
ferent frequency  bands  contribute  Independently  to  the  shaping 
of  the  response,  and  that  the  effects  of  different  bands  are 


Figure  18.1 

Family  of  curves  used  to  determine  the  level  rank 
for  residential  noise.  The  spectrum  of  the  noise 
Is  plotted  as  sound  pressure  levels  in  octave  bands 
of  frequency.  The  highest  zone  into  which  the 
spectrum  protrudes  Is  designated  as  the  level  rank 
of  the  noise . 
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not  additive.  At  present,  no  laboratory  data  are  available 
for  substantiation  of  this  assumption.  However,  we  shall 
note  later  that  the  case  histories  provide  data  that  are  not 
inconsistent  with  the  above  assumption,  and  for  the  present 
this  is  considered  as  adequate  Justification. 

The  Justification  for  selecting  the  particular  curves 
shown  in  Pig.  18.1  lies  primarily  in  the  evidence  obtained 
from  the  case  histories  cited  later  in  this  section.  A few 
reltiarks  are  pertinent  at  this  point,  however.  There  is 
reason  to  believe  that  the  loudness*  of  a given  band  of 
noise  is  correlated  to  some  extent  with  the  annoyance  it  pro- 
duces. Using  250-mel  bands  of  noise  as  stimuli  in  laboratory 
experiments  Kryter  1,2/  and  Pollack  3/  have  shown  that  subjects 
tend  to  Judge  equally  loud  stimuli  to  be  equally  annoying,  ex- 
cept at  very  high  and  low  frequencies.  It  happens  that  the 
boundary  lines  between  ranks  in  Fig.  I8.I  represent  noise 
spectra  which  have  approximately  equal  loudness  in  equal  mel 
bands,  assuming  free-field  listening.  Loudness  data  for 
continuous  spectrum  noise  do  not  extend  below  100  cps . The 
shape  of  the  curves  at  low  frequencies  represents  an  empirical 
extrapolation.  Thus  some  Justification  for  the  shape  of  the 
curves  is  provided  in  addition  to  evidence  from  field  data. 

The  distance  between  neighboring  curves  in  the  mid-  and 
high  frequency  range  is  approximately  5 dh . The  steps  are 
slightly  smaller  at  low  Intensities  and  somewhat  larger  at 
high  intensities.  At  low  frequencies  the  spacing  between 
curves  is  decreased,  as  the  bunching  of  the  equal  loudness 
contours  for  bands  of  noise  at  low  frequencies  would  suggest. 
Several  considerations  dictate  the  selection  of  5 db  as  a 
suitable  discrete  step  for  the  specifications  of  level  rank. 
From  previous  experience,  it  is  believed  that  the  range  of 
variation  normally  encountered  in  the  reactions  of  the  resi- 
dents of  a community  to  a given  noise  is  sufficiently  wide 
that  a change  of  noise  level  of  less  than  5 db  would  not  pro- 
duce a significant  change  in  the  general  pattern  of  reaction 
to  the  noise.  In  many  situations,  also,  the  fluctuations  of 
the  noise  levels  both  in  time  and  in  space  within  a community 
are  as  high  as  5 db,  and  it  is  unrealistic,  therefore,  to 
specify  the  levels  with  greater  precision. 

Correction  Numbers . Once  the  level  rank  has  been 
established  by  the  procedure  outlined  above,  attributes  of 


See  Section  l6.2 
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the  noise  other  than  the  average  level  and  the  spectrum  must 
be  evaluated  in  order  to  arrive  at  the  final  noise  rating. 

This  evaluation  takes  the  form  of  correction  numbers.  These 
correction  numbers  are  the  number  of  ranks  by  which  level 
rank  Is  increased  or  decreased.  The  final  result  of  this 
calculation  is  the  composite  noise  rating.  The  evaluation  of 
the  correction  numbers  Is  discussed  In  the  following  sections. 

A summary  of  all  correction  numbers  Is  presented  In  tabular 
form  following  the  discussion. 

1 . Spectrum  Character.  A noise  spectrum  that  contains 
audible  pure-tone  or  single-frequency  components  appears  to 

be  more  annoying  than  a spectrum  that  is  reasonably  continuous. 
This  deduction  Is  based  upon  engineering  experience  with  noise 
spectra  of  both  types.  If,  for  example,  the  sound  pressure 
level  in  an  octave  band  reaches  the  level  rank  D by  virtue  of 
the  contribution  of  a single  frequency  component,  it  Is  proposed 
that  a rank  correction  of  value  +1  be  applied,  l.e.,  that  the 
level  rank  of  the  noise  be  raised  from  D to  E.  For  a continuous 
spectrum  noise,  no  correction  should  be  applied.  The  implica- 
tion is  that,  all  other  things  being  equal,  the  level  of  a pure 
tone  must  be  about  5 db  below  the  level  of  a continuous  spec- 
trum noise  In  the  same  octave  band  in  order  to  produce  the 
same  neighborhood  reaction. 

2.  Peak  Factor.  A noise  that  Is  reasonably  continuous 
in  time,  at  least  over  periods  of  a few  seconds  or  more,  is 
assumed  to  be  less  annoying  than  an  Impulsive  type  of  noise 
An  example  of  Impulsive  noise  is  the  sound  from  a drop  forge 
or  gun  shots.  In  the  proposed  scheme  a level  rank  correction 
of  value  +1  should  be  applied  to  the  level  rank  If  the  noise 
Is  impulsive,  but  no  correction  should  be  applied  for  a con- 
tinuous noise . At  present  no  firm  definition  of  an  Impulsive 
type  of  noise  in  quantitative  terms  is  proposed,  and  some 
Judgment  is  required  to  distinguish  between  Impulsive  and 
continuous  noise . 

3.  Repetitive  Character.  In  addition  to  the  short-time 
peak  factor  discussed  above,  the  repetitive  character  of  the 
Intruding  noise  influences  the  neighborhood  reaction  to  a large 
degree.  For  example,  a noise  that  lasts  for  20  seconds  during 
every  minute  is  Judged  to  be  more  annoying  than  the  same  noise 
occurring  twice  a day.  A community  will  tolerate  the  occasional 
passage  of  a Jet  aircraft  overhead,  but  frequent  passage  of  si ch 
aircraft  may  lead  to  strong  complaints . Quantitative  evalua- 
tion of  the  Influence  of  repetitive  character  is  not  well  es- 
tablished at  present  because  field  data  are  not  available  for 
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a wide  range  of  conditions.  Assuming  exposures  of  20-30 
seconds  duration  (the  approximate  time  during  which  the 
sound  from  an  aircraft  overhead  can  be  heard).  Table  18.1 
gives  some  preliminary  estimates  of  the  correction  numbers  to 
be  applied  numerically  to  the  level  rank.  For  exposures  of 
different  duration,  a good  deal  of  Judgment  is  necessary  in 
selecting  the  appropriate  correction  number.  No  correction 
IS  to  be  applied  for  a noise  that  is  continuous . 


TABLE  18.1 

CORRECTION  NUMBERS  TO  ACCOUNT  FOR  REPETITIVE  CHARACTER  OF 
NOISE,  20-30  SECS  EXPOSURES  ASSUMED 


1 exposure  per  min  (or  continuous)  0 

10-60  exposures  per  hour  -1 

1-10  exposures  per  hour  -2 

4-20  exposures  per  day  -3 

1-4  exposures  per  day  -4 

1 exposure  per  day  -5 


This  table  is  clearly  very  tentative.  Here,  as  else- 
where, we  do  not  have  the  necessary  number  of  cases  to  tie 
down  the  correction  parameters  the  way  we  have  indicated.  As 
more  case  histories  are  checked  against  this  scheme,  it  will 
turn  out  that  certain  correction  procedures  represent  pretty 
good  guesses  while  others  are  In  need  of  modification. 

4.  Level  of  Background  Noise . The  level  of  background 
noise  Is  usually  lower  In  suburban  communities  than  in  urban 
areas  (see  Ch.  8).  In  the  city  the  average  noise  from  traffic 
is  greater  and,  especially  In  industrial  areas,  there  are  often 
contributions  to  the  noise  level  from  factory  machinery  and 
similar  sources . Residents  In  areas  with  low  background  level 
are  more  likely  to  react  to  intruding  noise  of  fairly  low  level 
than  those  In  areas  in  which  the  background  noise  partially 
masks  the  intruding  noise.  Also,  if  a fairly  high  background 
noise  exists,  residents  become  adjusted  to  the  intrusion  of 
noise,  auid  are  less  inclined  to  complain.  In  a sense/  the 
background  noise  plays  the  role  of  a reference  level  with  re- 
spect to  which  the  intruding  noise  is  measured. 
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On  the  basis  of  measurements  of  the  levels  of  background 
noise  in  various  communities,  and  on  the  basis  of  experience 
with  complaints  in  these  communities,  the  following  correction 
numbers  in  Table  l8.2  are  proposed  to  account  for  the  levels 
of  background  noise.  These  correction  numbers  are  added  to, 
or  subtracted  from,  the  level  rank.  Examples  of  the  approxi- 
mate range  of  background  levels  and  spectra  for  the  various 
communities  are  shown  in  Fig.  l8.2  and  approximate  correction 
numbers  are  noted  on  the  figure . 

If  we  have  a background  noise  spectrum  different  from 
the  contours  of  Pig.  l8.2  some  Judgment  is  required  in  estimat- 
ing the  appropriate  correction  numbers.  In  most  areas,  however, 
spectra  similar  in  shape  to  those  in  Pig.  18.2  will  be  found. 


TABLE  18.2 

CORRECTION  NUMBERS  TO  ACCOUNT  FOR  THE  AMBIENT  NOISE  LEVELS  IN 
THE  NEIGHBORHOOD  ( SEE  ALSO  PIG.  18.2) 


Neighborhood 

Correction  Number 

Very  quiet  suburban 

+1 

Suburban 

0 

Residential  urban 

-1 

Urban  near  some  Industry 

-2 

Area  of  heavy  Industry 

-3 

5.  Time  of  Day.  Most  residents  agree  that  intruding 
noise  is  more  tolerable  in  the  daytime  than  during  the  evening. 
During  the  night,  the  background  noise  levels  from  traffic  and 
other  sources  are  usually  lower  than  the  corresponding  daytime 
levels  and  hence  an  intruding  noise  is  subject  to  less  masking. 
Therefore,  the  noise  is  more  noticeable.  We  estimate  that  a 
correction  number  of  -1  should  be  applied  to  the  level  rank 

if  the  intruding  noise  occurs  only  in  the  daytime . No  correc- 
tion is  applied  for  round-the-clock  operation  or  for  operation 
after,  say,  ten  o'clock  at  night. 

6.  Adjustment  to  Exposure . Experience  has  shown  that 
residents  can  adjust  to  a varying  extent  to  an  Intruding  noise 
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after  repeated  exposures.  For  example,  people  near  a rail- 
road can  become  accustomed  to  the  noise,  even  though  they 
may  be  annoyed  during  the  first  few  days  of  exposure . The 
noise  of  an  occasional  aircraft  overhead  is  now  accepted  by 
most  people,  and  they  are  considered,  therefore,  to  be  adjusted 
to  this  sound.  No  correction  should  be  applied  to  the  level 
rank  If  the  intruding  noise  is  a new  one  to  which  the  residents 
have  not  been  exposed  previously.  If  there  has  been  some 
adjustment  to  the  noise,  a correction  number  of  -1  is  proposed. 
Only  In  extreme  conditions  (such  as  emergency  or  wartime) 
should  a correction  number  of  -2  or  more  be  applied.  We  are 
suggesting,  therefore,  that  adjustment  to  exposure  is  worth 
5 to  10  db. 

7.  Other  Factors . In  addition  to  the  physical  factors 
listed  above,  factors  of  a psychological  nature  Influence 
neighborhood  reaction  to  an  Intruding  noise,  though  by  no  meeins 
to  a constant  degree.  Such  questions  as  connotation  of  the 
noise  and  public  relations  between  the  community  and  those 
causing  the  noise  may  modify  the  noise  rating  to  a marked  ex- 
tent . To  account  for  these  more  subtle  factors  In  a quanti- 
tative fashion  is  a difficult  task. 

For  example,  a statement  In  the  press  to  the  effect  that 
a certain  type  of  noise  is  damaging  to  the  health  of  persons 
exposed  to  the  noise  could  suddenly  lead  to  strong  adverse 
public  reaction,  even  though  the  noise  level  remains  the  same 
and  physical  measurements  of  the  noise  might  even  Indicate  a 
low  noise  rating.  Again,  If  an  Intruding  noise  carries  the 
connotation  of  danger  (for  example,  the  noise  of  a low-flying 
aircraft,  especially  after  some  accidents  have  occurred  nearby) 
residents  are  likely  to  react  more  strongly  than  they  would  to 
a noise  of  the  same  level  but  originating  within  their  own  home 

For  some  residents,  the  source  of  noise  may  be  related 


Figure  18.2 

Family  of  curves  used  to  determine  the  correction 
number  for  background  noise . The  spectrum  of  the 
ambient  background  noise  is  plotted  as  sound  pres- 
sure levels  in  octave  bands  of  frequency.  The  zone 
in  which  the  major  portion  of  the  noise  spectrum 
lies  designates  the  correction  number  to  be  applied 
for  background  noise . 
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to  their  Job.  Under  such  circumstances,  they  may  be  much 
more  inclined  to  tolerate  a certain  noise  level  than  if  they 
considered  the  same  noise  as  thoroughly  unnecessary. 

The  present  scheme  for  estimating  neighborhood  reaction 
to  noise  makes  no  attempt  to  account  for  these  factors  which 
are  only  indirectly  related  to  the  physical  characteristics 
of  the  stimulus.  Such  factors  appear,  however,  to  be  of  major 
Importance  only  rarely.  Evidence  of  the  existence  of  these 
factors  should  always  be  sought  since  occasionally  it  may  be 
necessary  to  account  for  them  in  at  least  a seml-quantltative 
fashion . 

Seasonal  Influences  may  modify  the  neighborhood  reaction 
to  noise.  In  the  summer  many  residents  leave  their  windows 
open  a large  part  of  the  time  and  their  exposure  to  a noise 
originating  outdoors  is  likely,  therefore  to  be  more  severe . 

On  the  other  hand  the  meteorological  and  terrain  conditions 
are,  in  many  areas,  more  favorable  for  the  propagation  of  sound 
in  winter.  Consequently  the  seasonal  influences  depend  upon 
such  factors  as  climate  and  distance  from  the  noise  sources  to 
the  residential  areas . Thus  it  is  difficult  to  state  a general 
rule  that  is  applicable  in  all  cases.  Terrain  and  meteorological 
conditions  and  their  Influence  on  sound  propagation  have  been 
discussed  in  Chapters  9 and  12,  respectively. 

8.  Summary . Table  l8.3  summarizes  the  various  physical 
characteristics  of  the  stimulus  and  indicates  the  quantitative 
Influence  of  each  in  the  form  of  correction  numbers  that  must 
be  applied  to  the  level  rank  to  obtain  the  noise  rating. 

The  level  rank  and  the  noise  rating  are  identical  for 
the  case  in  which  (l)  the  noise  has  a continuous  spectrum, 

(2)  it  has  a uniform  short-time  character,  (3)  it  is  continuous 
over  long  time  Intervals,  (4)  it  is  present  at  nighttime,  (5) 
the  acoustic  environment  is  similar  to  that  of  a suburban 
community,  and  (6)  there  has  been  little  previous  exposure  to 
the  noise . 

Relation  between  Response  and  Noise  Rating . Having 
established  the  noise  rating  from  physical  measurements  of  the 
stimulus,  we  must  next  determine  the  expected  response  for 
each  noise  rating.  The  Inherent  assumption  is  that  in  the 
absence  of  dramatic  events  or  particular  psychological  circum- 
stances, stimuli  that  have  the  same  noise  ratings  (although 
perhaps  quite  different  level  ranks)  all  produce  the  same 
response  within  a range  of  statistical  variations . It  is  a 
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TABLE  18.3 

LIST  OF  CORRECTION  NUMBERS  TO  BE  APPLIED  TO  LEVEL  RANK  TO 

GIVE  NOISE  RATING 


InfluenclriK  Factor 

Possible  Conditions  Correction  No 

Spectrum  character 

Continuous 

0 

Pure -tone  components 

+1 

Peak  Factor 

Continuous 

0 

Impulsive 

+1 

Repetitive 
Character  (20-30 

> 1 exposure  per  min 
(or  continuous) 

0 

Sec  Exposures 
Assumed) 

10-60  exposures  per  hr 

-1 

1-10  " " " 

-2 

4-20  ” " day 

-3 

1-4  '•  " " 

-4 

< 1 exposure  per  day 

-5 

Background  Noise 

See  Figure  18.2 

+1  to  -3 

Time  of  Day- 

Daytime  only 

-1 

Nighttime 

0 

Adjustment  to 
Exposure 

No  previous  exposure 

0 

Considerable  previous 
exposure 

-1 

Extreme  conditions  of 
exposure 

-2 
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truism  that  all  residents  do  not  react  alike  to  a given 
stimulus.  There  is  a distribution  of  responses,  and  it  is 
necessary,  therefore,  to  express  the  response  in  terms  such 
as  expected  "average  response,"  and  "range  of  expected  re- 
sponses from  a normal  population." 

The  graph  in  Pig.  l8.3  shows  a relation  that  has  been 
derived  from  empirical  data.  The  response,  stated  in  descrip- 
tive terms,  is  plotted  as  the  ordinate,  and  the  properties  of 
the  stimulus  are  given  by  the  noise  rating,  which  forms  the 
abscissa. 

The  response  scale  is  again  a rank  order  scale.  At  the 
low  end  is  the  "no  annoyance"  region,  which  is  defined  as  the 
region  in  which  practically  no  residents  are  annoyed  by  the 
noise.  This  amounts  to  saying  that  residents  do  not  consider 
the  noise  to  be  "intruding."  Next  is  the  region  described  by 
"mild  annoyance."  In  this  range  residents  are  not  likely  to 
complain,  say, to  the  management  of  the  noisy  factory,  but 
they  may  comment  on  their  annoyance  among  themselves.  "Mild 
complaints"  may  be  in  the  form  of  an  occasional  telephone  call 
to  the  management.  The  next  step  on  the  scale  of  response  is 
"strong  complaints,  which  could  be  in  the  form  of  complaints 
to  the  management,  to  a newspaper,  or  to  the  police..  "Threats 
of  legal  action"  may  result  if  stimulus  conditions  are  still 
more  severe.  The  final  category  is  "vigorous  legal  action." 
These  last  two  terms  are  self-explanatory. 

Substantiation  by  Case  Histories  and  Other  Experiences . 
The  above  scheme  for  the  evaluation  of  neighborhood  reaction 
to  noise  can  be  justified  only  by  checking  it  against  empirical 
data.  • The  empirical  data  are  in  the  form  of  case  histories. 

We  have,  of  course,  used  some  of  our  case  histories  in  setting 
up  the  scheme,  and  it  is  hardly  conclusive  now  to  use  the  same 
case  histories  to  show  the  validity  of  the  scheme.  The  real 
test  for  the  scheme  will  come  in  the  future  when  its  ability 
to  predict  behavior  of  communities  will  be  tested  in  new 


Figure  18.3 

Relation  between  the  composite  noise  rating  and 
the  response.  The  expected  average  response  is 
shown,  together  with  the  expected  range  of  varia- 
tion. 
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situations.  In  the  meantime,  the  available  case  histories 
point  up  the  spectrum  of  experience  upon  which  this  scheme 
was  based,  and  hence  also  the  type  of  situation  it  ought  to 
be  able  to  handle . We  have  summarized  a number  of  case 
histories  in  which  data  from  physical  measurements  of  the  stim- 
ulus are  available,  and  in  which  some  subjective  data  on  neigh- 
borhood reactions  are  known. 

We  recognize,  of  course,  that  the  available  case 
histories  do  not  cover  a sufficiently  wide  range  of  conditions 
to  substantiate  the  proposed  scheme  in  all  details.  The  selec- 
tion of  some  of  the  correction  numbers  (such  as  those  for 
repetitive  character)  is  not  based  entirely  on  evidence  from 
case  histories,  but  may  be  based  more  on  common  experience  and 
even  "intuition."  As  further  data  are  gathered,  some  changes 
in  the  scheme  will  be  found  necessary.  Stimulus  characteristics 
other  than  those  considered  here  may  prove  to  be  Important  in 
determining  the  response. 

Another  limitation  of  the  case  histories  is  the 
uncertainty  in  the  definition  of  the  response  in  many  cases. 

It  is  clearly  impossible  to  obtain  an  adequate  sampling  of 
neighborhood  reaction  unless  a well -controlled  opinion  survey 
is  carried  out.  In  most  of  the  case  histories  described  here, 
the  specification  of  the  response  is  based  on  verbal  and 
documentary  evidence  obtained  from  the  management  of  the  facility 
that  is  responsible  for  the  noise. 

The  evidence  in  the  form  of  case  histories  is  summarized 
in  Table  l8.4.  All  pertinent  data  concerning  both  stimulus 
and  response  are  shown  in  the  various  columns  of  the  table . 

For  each  case  history  the  first  column  gives  a brief  descrip- 
tion of  the  facility  and  the  type  of  noise  it  produces.  The 
level  rank  for  the  noise  measured  in  the  residential  areas  is 
indicated  in  Column  2.  The  spectra  for  some  of  the  cases  are 
shown  in  Pig.  l8.4.  The  various  correction  numbers  are  indi- 
cated in  Columns  3-8,  and  the  composite  noise  rating  is 


Figure  l8.4 

Measured  spectra  of  intruding  noise  for  four  of 
the  case  histories  listed  in  Table  l8.4.  The 
dashed  lines  indicate  the  boundaries  of  the  level 
ranks,  from  Fig.  18.1.  In  each  case,  the  level 
rank  is  the  highest  rank  into  which  the  noise 
spectrum  protrudes. 
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SUMMARY  OP  CASE  HISTORIES  OF  RESPONSES  TO  NOISE  IN  RESIDENTIAL  AREAS 
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vldual  telephone 
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complaints . 


• 

0 

T5 

H 

0 

0 

43 

CQ 

\ 

1 

43 

d 

CQ 

d 

0 

d 

d 

0 

d 

0 

43 

0) 

43 

43 

Ch 

0 

0 

43 

0 

• 

0 

d 

0 

0 

43 

43 

a)  th 

c 

iH 

05 

d 

05 

1 

g 

43 

d 

0 w 

g 

43 

•H 

d 

43 

d 

d 

■d  P> 

•H 

(D 

0 

u 

0 

•H 

u 

•H 

< 43 

d 

Cd 

0 

0 

0 

•H 

•H 

0 

05 

43 

Ci 

> 

0 

u 

43 

0 

o5 

cd 

d 

•H 

0 

iH 

0 

d 

0 

cd 

0 

d 

pH 

0 

U 

rH 

• cd 

U 

d 

& 

•H 

0 

rH 

0 

0 

ft  TJ 

0 

d 

•H 

m 

o5 

d 

a-p  4-> 

£ 

6 

& 

d 

ft 

g 

c 

43 

ft 

05 

43 

T!5 

05 

0 

g 

C rH 

Pi 

43 

0 

P>»'0 

0 

g 

0 

0 

o5 

•H 

0 

(D  3 

0 

0 

iH 

0 

•H 

0 

Pi 

0 

0 

0 

•H 

d 

43 

0 

6 m 

d 

43 

iH 

0 

0 

d 

d 

43 

0 

05 

0 

d 

cd 

0)  C 

d 

0 

0 

d 

iH 

d 

0 

d 

0 

(Q 

(D 

0 

iH 

0 

d 

CQ 

bO  0 

•H 

•H 

43 

0 

Cd 

0 43 

d 

43 

T5 

0 

ft 

•H 

0 

• 

d 

o3  0 

43 

d 

43 

0 

d 

d 

0 

d 

0 

43 

• 

0 

to 

g 

•O 

43 

ft 

s 

0 

c: 

'd 

0 

0 

(Q 

0 

0 

C 

d 

(1> 

0 

43 

05 

0 

0 

cd 

0 

d 

cd  tH 

0 

d 

•H 

0 

0 

d 

0 

rH 

c 

ft 

0 

0 

43 

d 

0. 

0 

E cd 

rH 

d 

•H 

43 

0 

g 

to 

0 

B 

CQ 

d 

0 

d 

to 

0 

rH 

CQ 

d 

d 

0 

to 

0 

d 

iH 

0 

o5 

0 

0 

ft 

0 

rH 

•H 

0 *H 

Cd 

d 

0 

0 

Cd 

■H 

•H 

d 

0 

> -Q 

a-p 

ft 

ft 

0 

<H 

1 — 1 

> 

-P  -P 

0 

H 

pc; 

0 

43 

> 

> 

CQ 

c 

•H 

to  rd 

C5  rH 

O ft 
U g 
4:>  O 
CO  o 


0) 
o 
g 
05 

O 
C 

S 05 


CQ 

-P 


d 

o 


d 

0 

dn  "H 

<D  -P  CQ  d 
d 0 3 0 
£ Cd  0 *H 

0 

d 

0 

•H 

0 43 

■P  -P 

43 

0 

43 

0 

iH  0 0 

d 

0 

0 cd 

d cd  bO  Cd 

•H 

0 

cd 

43 

d)  bO  "rH 

to  cd 

43 

cd  iH 

(U  0)  > H 

d rH 

cd 

rH 

0 cd 

? 1 — 1 cd 

0 ft 

0 

Cd 

d hO 

43  T3  bD 

d g 

h 

to 

B 0 

0)  (»H  d <1> 

43  0 

0 

pq  0 cd  rH 

CO  0 

EH 

pH 

0 

Q 

pC4 

iH 

1 

rH 

1 

0 

1 — 1 
1 

i — 1 

1 

0 

0 

rH 

1 

0 

0 

0 

rH 

1 

0 

rH 

1 

0 

+1 

0 

iH 

1 

* 

♦ 

CVJ 

1 

0 

iH 

1 

0 

0 

in 

1 

0 

0 

0 

0 

0 

iH 

+ 

0 

0 

0 

rH 

rH 

0 

+ 

+ 

C5 

PC4 

d ^ 

•H  05 
0) 

43  d 

43 

05  43  d 
d 4:1  O 
o to  ft 

d -H  d 
•H  rH  "H 
^ 05 


VO 

WADC  TR 


cd 


43 

m 

0 

fH 

d 

1 0 

Pi  d 

Pi 

0 0 o5 

1 0 

•P  -P 

d cd 

43 

^ j. 

t043  0 d cd 

d rH 

Cd  d 0) 

d 0 

0 

cd  o' 

d d 0 0 

0 ft  0 

rH  0) 

0 > • 

0.  CD  CQ 

cd  0 1 > 

rH 

s •> 

g ro 

d 

E r-i  -p 

d 43  cn  0 d 

T5 

43  tOtH 

■P  bO 

d d 0 

cd 

0 hh  d 

43  0 0 

0 

<H  Ah  0 

bO-H  •>  d 

0 -H  d 

Ah 

0 (D 

0 -H  ft 

43 

cd  g 0 

d ? CQ  rH 

Ah 

CD  d 

d g to  0 

d 

■H  ^ P> 

0 0 43 

0 

d rH  0 

0 d d 43  0 

g 

0 0 43 

xJ  E 0 3 

d 0 0 

bO  (U  bO  Q< 

ft  0 «H  0 0 

•rH 

d d CQ 

cd  Sh  3 0 

cd  *H  *H 

d 

? d g 

cd  43  d d g 

43 

«H  *H  0 

0 0 d 

d 0 d 

cd 

0 -H  0 

0 C -H  d t4 

0 

< t043 

id  Ci_,  -P  CQ 

EH  d cr 

ft  0 0 

-H  <H  0 43 

0 

CO 

0 

rH 

♦ 

iH 

rH 

2-204 

199 

evaluated  In  Column  9*  The  average  predicted  response, 
from  Fig.  l8.3  Is  noted  In  Column  10.  Column  11  gives  a 
brief  description  of  the  neighborhood  reaction  that  was 
actually  encountered.  Comparison  of  Columns  10  and  11  Indi- 
cate fairly  good  agreement  between  the  predicted  and  the 
actual  response  In  all  cases. 

Let  us  examine  Case  History  4 as  an  example.  At  a 
large  aircraft  manufacturing  plant,  with  attached  airport, 
maintenance  checks  are  run  Intermittently  on  two-engine  re- 
ciprocating aircraft  during  the  daytime  hours  before  10:00  p.m 
The  duration  of  a run-up  may  be  as  high  as  15  minutes  and  a 
correction  number  for  repetitive  character  Is  set,  therefore, 
at  -1  (one  rank  lower  than  continuous  noise).  In  the  nearby 
residential  areas  no  complaints  are  submitted  to  the  manage- 
ment of  the  company.  Presumably  the  residents  have  become 
somewhat  adjusted  to  the  noise,  and  many  of  them  are  employed 
by  the  company.  The  average  spectrum  of  the  noise  In  the 
residential  area  Is  shown  In  Fig.  l8.4.  Computation  of  the 
noise  rating  Is  shown  In  Table  18.4.  It  Is  of  Interest  to 
note  In  this  example  that,  according  to  Fig.  l8.3^  mild  com- 
plaints to  the  management  would  be  expected  If  operations 
continued  through  the  night. 

l8 . 3 Criteria  for  Speech  Communication 

In  many  working  spaces  In  factories  and  In  offices, 
efficient  performance  of  tasks  Is  often  dependent  upon  the 
ability  of  people  to  talk  to  each  other.  Whether  the  environ- 
ment Is  a conference  room  or  a machine  shop,  noise  conditions 
should  be  adjusted  to  permit  communication  suited  to  the  task 
that  Is  to  be  performed.  The  type  of  communication  desired 
may  be  of  various  kinds,  from  conversation  In  a normal  voice 
at,  say,  20  ft  to  shouting  danger  signals  at  a distance  of 
6 In  from  the  listener's  ear.  The  acceptable  levels  of  the 
masking  noise  are  dependent,  therefore,  upon  the  particular 
tasks  Involved  and  upon  the  degree  to  which  speech  communica- 
tion Is  Important  In  the  performance  of  these  tasks,  or  In  the 
maintenance  of  adequate  morale  among  employees  (see  Sec.  17.2) 

In  this  section,  criteria  are  suggested  for  noise  levels 
that  should  be  acceptable  In  spaces  where  speech  communication 
Is  an  Important  consideration.  Account  Is  taken  of  not  only 
the  level  of  the  masking  noise  but  also  the  vocabulary  to  be 
used  In  the  communication,  the  voice  level,  the  distance  from 
the  speaker  to  the  listener, etc . 
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Two  general  types  of  jDaslc  material  are  available  for 
substantiation  of  the  proposed  criteria  for  speech  communica- 
tion: (l)  data  have  been  presented  in  Sec.  l6.4  on  the  phy- 

sical properties  of  speech  and  on  laboratory  experiments 
concerned  with  the  effects  of  masking  noise  and  other  distor- 
tions upon  the  intelligibility  of  speech.  In  the  laboratory 
experiments,  intelligibility  is  usually  expressed  in  terms 
of  the  percentage  of  words  or  other  test  items  recorded  cor- 
rectly during  a test,  i.e.,  the  articulation  score.  (2)  field 
data  of  a subjective  nature  have  been  collected  in  places  where 
people  are  exposed  to  noise  and  where  they  are  required  to 
talk  to  each  other.  Measurements  of  noise  levels  were  made, 
and  people  were  asked  how  well  they  could  understand  each  other. 

In  the  final  evaluation  it  is  necessary  to  Interpret 
the  above  two  sets  of  findings  with  care,  since  important  dif- 
ferences exist  in  the  way  they  were  obtained.  Most  of  the 
Intelligibility  tests  in  the  laboratory  were  conducted  with 
earphones.  In  studies  of  the  masking  of  speech  by  noise,  the 
masking  noise  was  usually  mixed  electrically  with  the  speech 
signal.  It  has  been  demonstrated,  however,  that  higher 
articulation  scores  are  obtained  when  the  phases  of  the  noise 
or  of  the  speech  in  the  two  ears  are  different  than  if  both 
signals  are  in  phase  in  the  two  ears  ^ (sec.  l6.4).  Hirsh  ^ 
has,  furthermore,  shown  that  there  is  an  improvement  in 
intelligibility  when  the  noise  and  the  speech  sources  are  lo- 
cated at  different  points  in  a room,  provided  the  subject  can 
move  his  head  freely.  When  listening  to  a person  speaking  in 
a noisy  room,  one  is  usually  at  liberty  to  turn  one’s  head  to 
face  the  talker  so  that  the  speech  arrives  at  the  two  ears  in 
phase . The  source  of  the  noise  is  usually  different  from 
that  of  the  speech,  however,  and  consequently  the  noise  signal 
at  the  two  ears  is  not  in  phase . An  Improvement  in  intelligi- 
bility should  therefore  be  expected  over  that  predicted  from 
laboratory  experiments  for  the  same  speech-to-nolse  ratio,  in 
which,  however,  boch  speech  and  noise  are  in  phase  at  the  two 
ears . 


An  incidental  observation  that  is  often  Important  in  field 
studies  is  that  the  listener  is  usually  able  to  observe  the  lips 
and  other  gestures  of  the  speaker.  Thus  he  probably  is 
assisted  by  visual  cues  as  well  as  auditory  ones  when  he  at- 
tempts to  understand  what  is  being  said.  No  quantitative  data 
are  available  at  present  on  the  relative  Importance  of  such 
visual  cues,  but  it  is  reasonable  to  suppose  that  they  do  make 
a contribution.  This  is  a problem  that  might  be  of  Importance 
in  military  situations  when  marginal  communication  is  possible 
during  daylight  hours. 
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Let  us  begin  by  reviewing  briefly  the  pertinent  data 
from  Sec.  l6.4  that  relates  speech  intelligibility  to  the  level 
of  the  masking  noise . We  note  that  the  Important  frequency 
range  for  speech  Is  200  to  6000  cps,  and  that  frequencies  above 
and  below  this  range  contribute  little  to  Intelligibility.  In 
fact.  Pig.  16.38  of  Sec.  l6.4  shows  that  80  percent  of  this 
Important  frequency  range*  is  covered  by  the  octave  bands  of 
frequency  600-1200,  1200-2400  and  2400-4800  cps . That  Is,  the 
frequency  components  of  speech  within  these  bands  contribute 
"80  percent  to  the  articulation  Index. 

It  has  been  shown  that  a single  number  which,  within 
certain  limits  to  be  discussed  below,  describes  the  masking 
effectiveness  of  a noise  Is  the  speech  Interference  level 
(SIL)  6,7,8/ . The  SIL  is  defined  as  the  average.  In  decibels, 
of  the  sound  pressure  levels  of  the  masking  noise  In  the  three 
octave  bands  of  frequency  600-1200,  1200-2400  and  2400-4800 
cps*t  The  speech  Interference  level  Is  Intended  to  provide, 
therefore,  a measure  of  the  extent  to  which  noise  Interferes 
with  the  ability  of  people  to  converse. 

If  the  SIL  Is  to  be  used  to  compare  the  ability  of 
people  to  communicate  by  voice  in  different  types  of  noise  in 
different  situations,  certain  precautions  must  be  observed. 
Ideally,  the  voice  level  of  the  talkers  should  be  fixed,  the 
same  type  of  vocabulary  should  be  used,  and  the  relative 


As  discussed  In  Sec.  l6.4,  the  frequency  scale 
is  distorted  In  such  a fashion  that  equal  dis- 
tances along  the  scale  represent  equal  contri- 
butions to  articulation  index. 

In  a discussion  of  acoustic  criteria  on  ship- 
board spaces,  Strasberg  9/  has  defined  the 
speech  Interference  level  as  the  average,  in 
decibels,  of  the  sound  pressure  levels  In  the 
four  octave  bands  of  frequency  3OO-6OO,  6OO- 
1200,  1200-2400,  and  2400-4800  cps.  In  most 
applications,  the  difference  between  the  values 
of  SIL  based  on  the  two  different  definitions 
Is  not  great . 
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positions  of  talker  and  listener  should  be  about  the  same. 

In  addition,  the  spectrum  of  the  masking  noise  should  be 
reasonably  smooth,  and  should  not  deviate  markedly  from  the 
speech  spectrum  shown  in  Fig.  16.27. 

The  SIL  becomes  a rather  Inaccurate  measure  of  the  mask- 
ing of  speech  by  noise  if  the  noise  contains  intense  low- 
frequency  components . Miller  10/  has  shown*  that  If  a low- 
frequency  masking  noise  (belowbOO  cps)  is  sufficiently  Intense 
It  can  mask  speech  completely.  For  this  case  the  SIL  could 
be  quite  small;  hence  It  would  not  constitute  a valid  measure 
of  the  amount  of  masking , 

If  the  noise  spectrum  has  a steep  slope  and  is  rather 
irregular  in  the  range  600-4800  cps,  the  process  of  estimating 
the  average  noise  level  in  each  octave  band  and  averaging  the 
levels  in  decibels  is  likely  to  Introduce  some  error.  In  the 
limiting  case  of  the  masking  of  speech  by  pure  tones,  square 
waves  and  pulses  (Fig.  16.33)  the  use  of  the  SIL  to  predict 
the  amount  of  masking  would  clearly  be  subject  to  considerable 
error.  For  example,  Stevens,  Miller  and  Truscott  11/  show 
that  a pure  tone  of  frequency  500  cps  is  the  most  effective 
sine  wave  masker  of  speech.  It  would  be  meaningless  to  talk 
of  an  SIL  for  such  a masking  signal . 

A final  observation  concerns  the  time  character  of  the 
masking  noise.  The  concept  of  the  SIL  was  originally  based 
on  data  derived  from  experiments  with  continuous  masking  noise . 
If  the  noise  is  Irregular  or  Interrupted  in  time,  the  Intelligi- 
bility is  affected  in  a way  which  the  SIL  does  not  predict 
(see  Fig.  16.34).  If  the  SIL  for  this  type  of  noise  were  mea- 
sured- with  conventional • Instruments,  the  result  would  obviously 
depend  upon  the  dynamic  characteristics  of  the  meter. 

At  present,  we  do  not  have  sufficient  experimental  data 
to  delineate  the  limits  of  intensity,  shape  of  spectrum  and 
time  character  of  the  noise  within  which  the  application  of  the 
SIL  is  valid.  It  is  fortunate,  however,  that  the  masking  noises 
encountered  in  most  practical  situations  have  a reasonably 
smooth  spectrum  and  a uniform  time  character.  In  such  situa- 
tions the  SIL  does  provide  a reasonably  good  approximation  of 
the  effectiveness  of  a noise  in  masking  speech.  The  validity 
of  the  use  of  the  SIL  is  also  Justified  by  many  observations 
in  the  field,  some  of  which  are  cited  below. 


Figs.  16.31  and  16.32  of  Sec.  l6.4. 
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Under  the  limitations  discussed  above  we  conclude  that 
ability  to  communicate  by  voice  In  the  presence  of  noise  Is 
determined  essentially  by  four  factors:  (l)  the  SIL  of  the 

masking  noise,  (2)  the  voice  level  used  by  the  talker,  (3)  the 
distance  from  the  talker's  mouth  to  the  listener's  ear,  and 
(4)  the  nature  of  the  vocabulary  used  In  communication. 

For  example,  a talker  is  better  understood  In  the  pre- 
sence" of  noise  If  he  raises  his  voice  and  moves  closer  to  the 
listener.  Also,  we  have  shown  In  Sec.  l6.4  that  Intelligibility 
Is  strongly  dependent  upon  the  nature  of  the  vocabulary;  the 
score  Is  higher  for  digits  than  for  unselected  words,  for 
example.  Thus  if  the  only  words  used  In  conversation  were.  In 
a linguistic  sense,  maximally  distinct  12/,  a much  higher  noise 
level  could  be  tolerated  than  If  continuous  conversation  were 
necessary. 

Beranek  6,7/  has  Investigated  the  degree  to  which  SIL, 
voice  level  and  distance  between  talker  and  listener  control 
the  ability  to  carry  on  a reliable  conversation.  Table  l8.5 
summarizes  his  findings.  If  a prearranged  restricted  vo- 
cabulary were  to  be  used,  the  distances  between  talker  and 
listener  could  be  Increased.  The  amount  of  Increase  would  de- 
pend upon  the  size  of  the  vocabulary  13/,  other  things  being 
equal.  (See  Sec.  l6.4).. 

It  is  assumed  that  the  talker  has  an  average  voice  and 
the  listener  normal  hearing.  A normal  voice  level  is  the  level 
one  would  use  in  a quiet  office  talking  with  a friend.  A very 
loud  voice  level  is  the  highest  voice  level  one  can  maintain 
over  a period  of  time  without  becoming  hoarse . There  are  often 
pronounced  differences  in  the  voice  level  used  by  different 
talkers.  A "normal"  voice  level  may  vary  by  as  much  as  - 8 
decibels,  depending  upon  the  talker.  Thus  the  numbers  in 
Table  l8.5  must  be  Interpreted  as  statistical  averages  only, 
and  should  be  applied  with  caution  in  situations  involving 
only  one  or  two  individuals. 

Empirical  evidence  indicates  that  there  is  a difference 
of  about  6 db  between  the  categories  of  voice  level  shown  in 
Table  l8.5  for  the  average  talker.  Thus  the  SIL's  change  in 
6 db  steps  as  we  move  across  the  table.  The  SIL's  decrease 
with  increasing  distance,  owing  to  the  "inverse  square"  decrease 
in  speech  level  with  distance.  It  should  be  noted,  however, 
that  the  sound  pressure  level  at  some  distance  from  a source 
in  a room  may  be  greater  than  that  predicted  from  the  Inverse 
square  law,  owing  to  reverberation  in  the  room  (Chapter  3) . 
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TABLE  18.5  6,7/ 

SPEECH  INTERFERENCE  LEVELS  (IN  DECIBELS  RE  0.0002  DYNE/CM^) 
WHICH  BARELY  PERMIT  RELIABLE  CONVERSATION  AT  THE  DISTANCES 
AND  VOICE  LEVELS  INDICATED 


Voice  Level 


Distance  (ft) 

Normal 

Raised 

Very  Loud 

Shouting 

0.5 

71 

77 

83 

89 

1 

65 

71 

77 

83 

2 

59 

65 

71 

77 

3 

55 

6l 

67 

73 

4 

53 

59 

65 

71 

5 

51 

57 

63 

69 

6 

49 

55 

6l 

67 

12 

43 

49 

55 

6l 

This  fact  mav  have  to  be  taken  into  account  when  Table  l8.5  is 
used.  The  Table  applies  to  the  case  in  which  there  are  no  re.- 
flecting  surfaces  nearby,  and  the  listener  and  talker  are  fac- 
ing each  other. 

Laboratory  data  on  the  intelligibility  of  speech  masked 
by  noise  are  in  general  agreement  with  the  data  of  Table  l8.5. 
For  example.  Miller  10/  has  investigated  the  masking  of  speech 
by  white  noise,  by  bands  of  noise  and  by  speech  itself.  If 
we  comput  the  SIL's  of  his  masking  signals  we  find  that  in  most 
cases  the  SIL’s  which  oarely  permit  reliable  conversation  (i.e. 
a sentence  intelligibility  close  to  100  percent)  agree  with  the 
levels  of  Table  18.5  within  a few  decibels.  We  must  make 
allowance  for  the  fact  that  Miller's  subjects  wore  earphones, 
and  that  subjects  who  listen  without  earphones  and  are  free  to 
turn  their  heads  would  obtain  somewhat  higher  articulation 
scores.  In  addition,  we  find  general  agreement  between  Beranek 
6, 14/  graphical  scheme  for  computing  the  articulation  index  and 
the  data  of  Table  l8.5>  provided  the  spectrum  of  the  masking 
noise  does  not  differ  markedly  from  the  speech  spectrum. 

WADC  TR  52-204  205 


The  original  field  studies  6/  that  attempted  to  validate 
the  SIL  concept  were  carried  out  in  the  presence  of  "simulated 
airplane  noise"  (from  propeller  driven  aircraft).  In  a later 
survey  of  noise  in  office  spaces,  Beranek  and  Newman  7^8/ 
were  able  from  their  measurements  and  questionnaires  to  vali- 
date in  a general  way  some  of  the  data  in  Table  18.5.  Few 
other  systematic  studies  have  been  conducted.  However, 
numerous  observations  have  been  accumulated,  and  the  "expert" 
can  usually  tell  if  the  SIL  concept  will  predict  correctly  or 
not.  A comprehensive  investigation  relating  ability  to  con- 
verse to  octave  band  measurements  of  the  Interfering  noise  in 
various  environments  would  prove  most  valuable . 

The  field  studies  have  also  provided  Information  on  the 
nature  of  the  possible  communication  in  various  noise  environ- 
ments. For  a SIL  of  45  db,  relaxed  conversation  was  possible; 
for  a SIL  of  55  db,  continuous  communication  was  possible,  but 
usually  in  a somewhat  raised  voice . Intermittent  communication 
would  be  carried  on  in  a noise  with  a SIL  of  65  db,  whereas 
only  a minimal  type  of  communication  was  possible  at  a SIL  of 
75  db.  A restricted  prearranged  vocabulary  might  be  desirable 
at  a SIL  of  75  db. 

The  above  comments  on  the  SIL,  distance,  and  voice  level 
necessary  for  reliable  conversation  amount  to  the  assumption 
that  the  frequency  components  of  a noise  below  600  cps  are 
not  effective  in  masking  the  speech.  The  type  of  noise  spec- 
trum normally  encountered  in  secretarial  offices,  tabulating 
rooms  and  private  offices  is  well-behaved  from  the  viewpoint 
of  the  SIL  concept . 

Laboratory  data  summarized  in  Figs.  16.31  and  16.32  of 
Sec.  l6.4  have  shown  that  low  frequency  noise  of  high  level 
can  be  a very  effective  masker  of  speech.  Under  these  circum- 
stances, the  masking  effect  of  the  noise  is  not  limited  to  the 
frequency  band  in  which  there  is  noise  energy,  but  extends 
into  the  higher  frequency  range . Thus  for  a masking  noise 
which  has  Intense  low  frequency  components,  the  SIL  is  not  an 
adequate  measure  of  the  masking  effectiveness.  If  a given 
quality  of  speech  communication  is  desired  (the  quality  being 
defined  in  terms  of  voice  level  and  distance  required  for  reli- 
able conversation,  as  in  Table  l8.5)>  we  must  specify  the 
maximum  acceptable  levels  of  the  masking  noise  in  each  octave 
band  of  frequency  below  600  cps,  as  well  as  the  SIL. 
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Such  a specification  is  presented  in  Fig.  l8.5  in  the 
form  of  speech  communication  (SC)  criterion  curves.  The 
extrapolation  of  these  curves  to  low  frequencies  is  based  on 
empirical  evidence . Further  field  data  on  masking  of  speech 
by  low  frequency  noise  are  required  to  provide  further  valida- 
tion of  the  low  frequency  portions  of  the  curves.  The  SC-55 
curve,  for  example,  specifies  for  each  octave  band  of  the 
masking  noise  the  sound  pressure  level  that  must  not  be 
exceeded  if  speech  communication  conditions  comparable  to  those 
for  a SIL  of  55  db  (from  Table  l8.5  are  to  be  realized. 

The  speech  communication  criteria  are  summarized  in 
Table  l8.6.  The  number  designating  the  SIL  is  shown  in  the 
left  column.  Other  columns  indicate  the  voice  level,  the  dis- 
tance from  talker  to  listener,  and  the  nature  of  the  possible 
communication.  The  final  column  suggests  the  type  of  working 
spaces  for  which  the  various  criteria  are  recommended.  This 
table  summarizes  (l)  data  recorded  by  means  of  questionnaires 
administered  during  a survey  of  noise  levels,  (2)  evaluative 
opinions  with  regard  to  noise  in  working  spaces,  and  (3)  ex- 
perience of  acoustical  consultants  at  many  different  locations. 
A more  detailed  discussion  of  criteria  for  noise  control  in 
certain  specific  types  of  working  areas  is  presented  in  Sec. 
18.5. 


Figure  l8.5  shows  curves  SC-25  and  SC-35*  although  a 
description  of  the  communication  conditions  and  environments 
for  which  these  criteria  are  applicable  is  not  contained  in 
Table  18.6.  The  low  levels  defined  by  these  curves  are  not 
usually  necessary  in  relatively  confined  working  spaces.  How- 
ever., if  reasonably  good  hearing  conditions  for  speech  and 
music  are  to  be  provided  in  audltorla,  the  background  noise 
levels  must  be  quite  low.  For  example,  a SIL  of  about  25  db 
would  be  desirable  in  large  auditoriums  where  excellent  hear- 
ing conditions  must  be  provided,  and  where  a high  degree  of 
speech  intelligibility  is  desired  15^16/  (see  Sec.  18.5). 

l8.4  Criteria  for  Prevention  of  Hearing  Loss 

In  Sec.  17.4  we  have  seen  that  prolonged  exposure 
to  noises  having  certain  sound  pressure  levels  and  frequency 
characteristics  can  produce  permanent  shifts  in  man's  auditory 
threshold.  In  surroundings  in  which  noise  may  produce  per- 
manent hearing  loss,  it  is  Important  to  reduce  or  control  the 
noise  that  reaches  the  ears  of  persons  who  must  remain  in  the 
noisy  environment.  In  these  situations,  criteria  for  noise 
control  are  required.  Such  criteria  should  specify  the 
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characteristics  of  noises  that  are  unlikely  to  produce 
permanent  hearing  loss.  A criterion  of  this  type  has  been 
termed  a Damage  Risk  (DR)  Criterion.  It  Is  designed  to  reduce 
the  risk  that  an  Individual  will  suffer  permanent  damage  to 
his  hearing.  The  concept  of  risk  Is  a statistical  concept; 
hence  the  successful  validation  of  a DR  criterion  Involves  the 
demonstration  that  the  Incidence  of  hearing  loss  In  the  noise - 
expx)sed  population  does  not  differ  significantly  from  the 
Incidence  In  normal  populations  for  the  same  age  group. 

The  Specification  of  a DR  Criterion.  Before  we  begin  to 
establish  or  even  to  propose  a DR  criterion,  we  must  decide 
how  such  a criterion  Is  to  be  specified.  What  are  the  Important 
characteristics  of  the  acoustic  stimulus  that  are  effective  In 
producing  pennanent  hearing  loss?  How  Is  man's  reaction  to  the 
exposure,  l.e.,  the  damage  to  his  hearing,  to  be  measured  and 
specified?  Some  of  these  matters  have  been  discussed  In  some 
detail  In  Sec.  17 so  that  we  need  to  present  here  6nly  an 
outline  of  the  considerations  that  are  pertinent  to  the  spe- 
cification of  a DR  criterion. 

Let  US  first  examine  the  aspects  of  the  exposure  stimulus 
that  are  likely  to  be  Important  In  producing  permanent  hearing 
loss.  It  Is  clear  that  the  sound  pressure  level  of  the  noise 
to  which  an  Individual  Is  exposed  Is  an  Important  determinant 
of  the  damage  that  may  be  expected.  Other  things  being  equal, 
the  more  Intense  the  noise  the  greater  Is  the  probability  for 
the  occurrence  of  permanent  hearing  loss . We  could  even  think 
of  defining  a population  threshold  for  permanent  damage  In 
terms  of  a convenient  probability  value,  of,  say,  50  percent. 


Figure  18.5 

Speech  communication  (SC)  criteria.  The 
curves  are  labeled  with  numbers  equal  to  the 
speech  Interference  levels  they  represent. 

Each  curve  specifies  the  octave-band  pressure 
levels  that  must  not  be  exceeded  If  a certain 
quality  of  speech  communication  Is  to  be  guar- 
anteed. In  each  case  the  quality  of  speech 
communication  Is  given  In  Tables  I8.5  and  I8.6 
In  terms  of  the  SIL,  voice  level,  distance 
from  talker  to  listener,  and  type  of  vo- 
cabulary to  be  used. 
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Frequency  band  cps 
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TABLE  18.6* 


SPEECH  COMMUNICATION  CRITERIA 

Relation  between  SC  Criteria  Expressed  by  Speech  Interference 
Levels  and  The  Communication  Conditions  for  a Degree  of  In- 
telligibility that  is  Marginal  with  Conventional  Vocabulary 
and  Good  with  Selected  Vocabulary. 


Voice  Level 

SIL  In  and  Nature  of  Possible  Type  of  Working 

Decibels  Distance  Communication  Area 


45 

55 


65 


75 


Normal  Voice 
at  10  ft 


Relaxed  Conversa- 
tion 


Private  Offices, 
Conference  Rooms 


Normal  Voice 
at  3 ft 


Continuous  Com- 
munication In  Work 
Areas 


Business,  Se- 
cretarial, Con- 
trol Rooms  of 
Test  Cells,  etc. 


Raised  Voice 
at  6 ft 


Very  Loud  Voice 
at  12  ft 

Raised  Voice  Intermittent 
at  2 ft  Communication 


Very  Loud  Voice 
at  4 ft 


Shouting  at 
8 ft 

Very  Loud  Minimal  Communication 

Voice  at  1 ft  (danger  signals;  re- 
stricted prearranged 
vocabulary  desirable) 

Shouting  at 
2 - 3 ft 


* Before  this  table  Is  used,  the  comments  and  limitations 
noted  In  the  text  should  be  consulted. 
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In  Sec.  17 A we  have  seen  that  noise -Induced  hearing 
loss  Is  not  frequency  Independent.  We  should  expect,  there- 
fore, that  permanent  damage  to  hearing  depends  upon  the 
spectrum  of  the  noise  as  well  as  Its  overall  sound  pressure 
level . 


The  temporal  character  of  the  exposure  Is  another 
important  variable.  The  evidence  from  laboratory  and  field 
data  has  shown  that  the  amount  of  permanent  hearing  loss  that 
Is  incurred  Is  not  Independent  of  the  duration  of  the  exposure. 
There  may  even  be  a measurable  effect  of  Intermlttency,  l.e., 
of  the  way  in  which  the  exposure  Is  packaged  in  time.  Ex- 
posure to  a certain  noise  every  working  day  for  & lifetime 
will  produce  after-effects  that  differ  In  order  of  magnitude 
from  those'  Incurred  after  a day,  a week  and  perhaps  even  a 
year  of  exposure  to  this  same  noise . 

The  Impulsive  character  of  the  noise  Is  another  Important 
parameter  of  exposure . A noise  that  is  characterized  by  sharp 
peaks  (such  as  the  noise  In  a drop  forge  shop)  will  not  ne- 
cessarily produce  more  damage  than  a continuous  noise  with  an 
intensity  that  is  appreciably  lower  than  the  peak  Intensity 
of  the  impulsive  noise.  In  the  discussion  that  follows,  we 
shall  be  concerned  chiefly  with  noises  whose  time  character 
Is  relatively  uniform.  We  shall,  however,  cite  a case  history 
In  order  to  indicate  how  one  might  go  about  modifying  the  DR 
criterion  for  steady  noise  In  order  to  extend  Its  application 
to  a noise  of  the  more  Impulsive  type . 

Specification  of  man's  reaction  or  response  forces  us 
to  settle  upon  a method  for  determining  the  damage  to  man's 
hearing . Such  a decision  Involves  many  complex  problems . If 
we  require  that  the  exposure  stimulus  must  not  produce  signi- 
ficant permanent  damage  to  hearing,  we  must  spell  out  (l)  the 
exact  way  In  which  this  damage  Is  to  be  measured,  and  (2)  the 
amount  of  permanent  damage  that  we  are  willing  to  consider 
significant . 

In  Sec.  16.3  (Audiometry),  we  have  seen  that  damage  to 
hearing  can  be  measured  In  several  ways.  We  also  tried  to 
stress  the  fact  that  the  -different  measures  will  not  necessarily 
be  highly  correlated.  We  can,  for  Instance,  determine  the  hear- 
ing loss  for  pure  tones  at  each  of  several  frequencies.  Alter- 
nately, we  might  be  concerned  with  hearing  loss  for  speech  or 
with  discrimination  loss . We  have  seen  that  In  some  respects 
the  use  of  speech  material  as  test  stimuli  provides  the  most 
meaningful  measure  of  the  social  adequacy  of  man's  auditory 
capacity. 
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Various  formulae  have  been  suggested  (Fowler  17/ 1 
American  Medical  Association  l8A  Carter  19, 20/,  and  Fletcher 
21, 22/)  In  order  to  arrive  at  a single  overall  figure  for  a 
person's  hearing  loss.  These  formulae  weight  the  several  test 
frequencies  differently.  Their  authors  differ  also  In  the  way 
In  which  they  combine  losses  for  the  two  ears.  Currently  new 
attempts  are  under  way  to  write  medico-legal  standards  for 
compensation  resulting  from  Injury.  Both  pure  tone  audiograms 
ancT  data  on  speech  Intelligibility  will  be  considered.  However, 
for  the  present,  we  shall  base  our  criteria  upon  permanent 
threshold  shifts  for  pure  tones*. 

Several  complications  attend  the  specification  of  the 
amount  of  hearing  loss  for  pure  tones  that  we  shall  agree  to 
consider  "significant”.  In  Sec.  16.3  we  referred  to  the  various 
"normal"  audiograms  established  by  several  hearing  surveys.  We 
also  noted  the  phenomenon  of  presbycousls,  l.e.,  the  Increasing 
hearing  loss,  especially  for  the  higher  frequencies,  with  age. 

We  further  noted  that  for  the  older  age  groups  hearing  losses 
for  the  higher  frequencies  are  by  no  means  "normally" distri- 
buted (in  a statistical  sense).  There  Is  a strong  tendency 
for  the  distribution  curves  to  be  skewed  toward  the  high  loss 
end.  It  follbws  that  an  audiogram  exhibiting  perceptive  hear- 
ing loss  for  one  Individual  cannot  give  positive  Information 
on  whether  his  hearing  loss  Is  due  to  exposure  to  noise  or  to 
other  factors  such  as  Infections,  tumors,  or  old  age.  Since 
there  Is  a substantial  variation  In  hearing  loss  among  the 
population,  specification  of  the  amount  of  hearing  loss  that 
Is  to  be  considered  significant  must  be  In  statistical  terms . 

One  standard  technique  of  demonstrating  statistical 
significance  of  the  Incidence  of  hearing  loss  In  an  industrial 
population  for  Instance,  would  consist  In  showing  that  Its 
mean  hearing  loss  Is  different  than  the  mean  hearing  loss  of 
a standard  (or  control)  population  at  a level  that  Is  sta- 
tistically significant.  Such  a procedure  Is  eminently  desir- 
able but  Is  rarely  attainable  in  view  of  the  sampling  problems 
Involved.  In  practice  we  may  have  to  be  satisfied  with  tests 
that  are  statistically  less  satisfactory.  For  example,  we 
might  hope  to  show  that  the  audiograms  of  an  experimental 
population  (l.e.,  of  a population  that  has  been  exposed)  -do 


We  emphasize  that  these  criteria  are  aimed  at  per- 
ceptive (or  nerve)  deafness  only;  conductive  losses 
are  clearly  unrelated  to  after-effects  of  exposure 
to  noise . 
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or  do  not  differ  significantly  from  their  audiograms  mea- 
sured before  exposure.  On  the  other  hand  we  might,  in  the 
absence  of  exposure  data,  have  to  be  satisfied  with  a finding 
that  indicates  an  abnormally  high  percentage  of  people  in  the 
exposed  population  whose  audiograms  exhibit  hearing  losses  in 
the  higher  frequencies  of  the  order  of  20  or  25  db  (or  even 
higher) . Such  losses  would  place  the  individuals  concerned 
into  the  category  of  the  5 or  10  percent  of  the  American 
population  who  hear  most  poorly. 

The  comments  presented  above  indicate  that  an  adequate 
specification  of  a DR  criterion  is  a complicated  matter  involv- 
ing many  variables . The  kind  of  criteria  we  should  like  to  set 
up  should  be  based  upon  thorough,  long-term  studies  involving 
data  of  the  type  discussed  above . At  present  we  do  not  have 
such  data,  but  we  are  in  a situation  where  it  is  considered 
necessary  to  set  up  tentative  criteria  for  immediate  use.  We 
should,  incidentally,  be  sure  that  our  tentative  criteria  can 
provide  a reasonable  framework  for  future  validating  studies*. 


* The  above  comments  should  also  make  the  lack  of  agree- 
ment among  certain  criterion-makers  more  plausible.  It 
is  really  so  surprising  that,  in  the  absence  of  agree- 
ment on  definitions  of  exposure  and  hearing  loss.  Sterner 's 
poll  23/  of  more  than  200  acousticians,  otologists,  psy- 
chologists, industrial  physicians  and  hygienists  indicated 
wide  differences  of  opinion  on  the  question  of  a DR  cri- 
terion? Even  if  these  men  had  come  to  a preliminary 
agreement  on  definitions,  they  would  still  have  been  faced 
with  a lack  of  data.  Whatever  data  exists  hardly  satis- 
fies the  necessarily  multidimensional  requirements  of  such 
definitions.  Only  recently  have  the  various  professions 
become  aware  of  the  necessity  of  gathering  the  appropriate 
data..  Since  the  taking  of  such  data  can  not  be  rushed, 
it  will  probably  take  a few  years  before  we  will  have  the 
evidence  upon  which  to  base  more  than  a tentative  DR 
criterion. 

Since  World  War  II  the  problems  of  noise  have  claimed  the 
attention  of  many  groups.  Conferences  organized  by  the 
medical  profession  24/,  the  Noise  Abatement  Symposia  25/, 
and  the  University  of  Michigan  Training  Course  26/  testify 
to  the  popularity  of  the  subject.  However,  the  group  that 
has  done  the  most  persistent  Job  of  drawing  the  attention 
of  responsible  management  officials.  Industrial  physicians 
and  hygienists,  and  safety  engineers  to  the  problems  of 
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A Proposed  DR  Criterion  for  Steady  Noise  and  Lifetime 
Exposures . The  starting  point  for  our  suggested  criterion  is 
Kryter's  monograph  In  a discussion  of  Maximum  Safe  Inten- 
sity Level  we  find  the  following  statements:  ”A  fair,  perhaps 

conservative  evaluation  of  the  laboratory  and  industrial 
studies  on  stimulation  deafness  would  seem  to  be  that  for  long 
and  intermittent  exposures  any  frequency  of  sound  (or  narrow 
band  not  exceeding  the  critical  width)  that  is  85  db  or  less 
abo.ye  0.0002  dyne/cm^  will  not  cause  any  temporary  or  per- 
manent deafness.  The  "guess"  that  tones  85  db  above  0.0002 
dyne/cm2  may  cause  some  deafness,  either  temporary  or  permanent, 
applies  only  for  long  periods  of  exposure,  applied  intermit- 
tently over  months  or  years.  On  the  other  hand,  for  brief 
exposures  lasting  up  to  an  hour,  the  intensities  necessary  to 
cause  deafness  appear  to  be  in  the  order  of  100  db  re  0.0002 
dyne/cm2  for  any  frequency  or  critical  band."  Somewhat  fur- 
ther on  in  his  monograph  Kryter  states:  "it  is  possible,  but 

undemonstrable  with  present  data,  that  the  degree  of  deafness 
could  be  predicted  by  the  use  of  critical  band  measures  of 
noise  intensity  and  plotting  such  measures  relative  to  85  db 
above  the  threshold  for  pure  tones  in  a manner  somewhat 
similar  to  that  ...  [used]  for  calculating  speech  Intelligi- 
bility. A suitable  threshold  line  would  be  the  minimum  audible 


* noise-induced  hearing  loss  is  the  so-called  Subcommittee 
on  Noise  in  Industry.  (Its  parent  committee  is  the 
Committee  on  Conservation  of  Hearing  of  the  American 
Academy  of  Ophthalmology  and  Otolaryngology) . The  sub- 
committee has  a field  office  in  the  Los  Angeles  area;  it 
has  conducted  surveys  of  industrial  noise  and  deafness; 
it  has  been  concerned  with  problems  of  ear  protection  and 
has  worked  on  the  design  of  so-called  predictive  tests 
(tests  capable  of  predicting  on  the  basis  of  a short  ex- 
posure to  a carefully  controlled  noise  how  a given  indi- 
vidual might  react  to  prolonged  exposure  to  industrial  or 
airplane  noise,  for  Instance ) . Recently  the  American 
Standards  Association  has  set  up  a subcommittee  under  the 
name  of  "Bio-  and  Psycho-Acoustic  Criteria  for  Noise  Con- 
trol" (Z  24-X  2) . Its  first  task  is  to  explore  whether 
the  data  available  would  permit  the  writing  of  tentative 
standards,  l.e.,  criteria  in  the  area  of  noise -Induced 
hearing  loss . Finally,  mention  should  be  made  of  the  Com- 
mittee on  Noise  of  the  Acoustical  Society  of  America.  The 
committee  attempts  to  act  as  a clearing  house  for  the  al- 
most one  hundred  groups  that  are  in  various  ways  concerned 
with  the  effects  of  noise  on  man. 
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pressure  [curve]  ...  Because  of  the  greater  intensities  re- 
quired to  reach  thresholds  at  low  and  high  frequencies,  .0.0002 
dyne/cm2  would  represent  a more  conservative,  safer  reference 
point  for  calculating  the  probable  deafening  effects  of  noise 
than  the  minimum  audible  threshold;  further  experiments  are 
needed  to  determine  which  reference  level  is  more  suitable.” 

We  have  quoted  these  excerpts  in  their  entirety  since  the 
Kryter  criterion  has  been  Incorporated  into  several  other  cri- 
teria and  since  it  has  also  been  misunderstood  by  some . 

As  Justification  for  his  proposed  criterion,  Kryter  refers 
to  several  laboratory  experiments  on  the  temporary  effects  of 
exposure  to  high  Intensity  stimuli  and  to  several  field  studies 
of  hearing  loss  following  exposure  of  workers  to  Industrial 
noise.  Since  sufficiently  detailed  and  extensive  data  were 
not  available,  Kryter  was  unable  to  specify  the  criterion  in 
more  detail . 

Since  our  DR  criteria  are  designed  to  prevent  permanent 
damage  we  have  to  exclude  consideration  of  temporary  deafness. 

We  have  seen  in  Sec.  1?.^  that  practically  all  audible  stimuli 
give  rise  to  temporary  threshold  shifts . Some  of  these  pheno- 
mena last  a few  milliseconds  only;  in  others,  recovery  extends 
over  periods  of  several  hours,  days  and  perhaps  even  months. 

We  shall  be  concerned  here  with  permanent  shifts  only,  i.e., 
shifts  that  represent  irreversible  changes  in  the  organism's 
response  capacity  after  a reasonable  recovery  Interval  has 
intervened  between  the  end  of  exposure  and  the  audiogram  show- 
ing the  loss . 

Next,  we  should  like  to  aim  for  a more  precise  statement 
of  the  characteristics  of  the  exposure  both  with  respect  to 
duration  and  spectrum.  Finally,  we  feel  that  there  is  some 
necessity  for  introducing  a statement  about  the  way  in  which 
such  a criterion  can  be  calibrated  with  respect  to  hearing 
loss.  In  other  words,  we  need  to  specify  in  a statistical  way 
the  amount  of  hearing  loss  that  we  shall  consider  to  be 
significant . 

It  is  from  this  viewpoint  that  we  have  modified  and 
qualified  the  so-called  Kryter  criterion.  Our  definition  of 
hearing  loss  is  the  one  developed  in  the  first  part  of  this 
section,  with  major  emphasis  upon  the  type  of  high  frequency 
hearing  loss  that  is  potentially  a forerunner  of  more  serious 
trouble  in  terms  of  Impaired  speech  communication. 
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Kryter  hesitated  between  two  contours,  one  at  85  db 
re  0.0002  dyne/cm2  and  the  other  at  85  db  sensation  level 
(presumably  above  the  binaural  MAF  threshold).  In  the  light 
of  the  available  evidence  we  took  a compromise  position  between 
the  two  contours . We  did  not  feel  that  we  could  go  as  high  as 
85  db  sensation  level  at  the  low  frequency  end,  this  would  have 
meant  approximately  125  and  l40  db  SPL  In  the  frequency  bands 
75-100  and  20-75  cps  respectively.  However,  evidence  on 
exposure  to  low  frequency  noise  (such  as  that  obtained  In  the 
cabins  of  conventional  aircraft)  seems  to  Indicate  that  noises 
In  which  the  energy  Is  concentrated  below  300  cps  are  less 
effective  In  producing  permanent  hearing  loss  than  stimuli  of 
equal  SPL  In  the  middle  range  of  frequencies  around,  say, 

1200  cps.  Hence  the  upward  bend  In  our  criterion  curve  for 
frequencies  below  5OO  cps,  as  we  shall  see  In  Pig.  I8.6. 

At  high  frequencies  there  Is  a range  In  which  the  85  db 
sensation  level  contour  represents  the  more  conservative  posi- 
tion than  the  85  db  SPL  contour.  In  this  region  the  binaural 
MAF  threshold  falls  below  0 db  SPL  because  of  diffraction 
effects  around  the  head  and  resonances  In  the  outer  ear.  We 
have  again  made  a compromise  on  the  conservative  side . With- 
out dropping  as  low  as  85  db  sensation  level,  we  have  taken 
some  account  of  the  evidence  that  seems  to  Indicate  particular 
sensitivity  to  hearing  loss  In  this  frequency  region.  Thus 
our  criterion  In  the  region  between  1200  and  4800  cps  slopes 
slightly  below  85  db  SPL.  Little  Importance  should  be  attached 
to  the  frequency  region  above  4800  cps  In  view  of  the  diffi- 
culties of  measuring  thresholds  of  audibility  for  such  fre- 
quencies and  the  relatively  small  Importance  of  these  frequencies 
In  the  communicative  process . 

Our  discussion  of  frequency  characteristics  has  applied 
to  pure  tones  or  to  stimuli  for  which  a major  portion  of  the 
energy  Is  concentrated  In  bands  that  are  narrower  than  the 
critical  width  (see  Sec.  l6.2).  In  order  to  specify  a cri- 
terion for  wide  band  noise,  we  must  make  an  assumption  regard- 
ing the  way  In  which  the  effects  of  exposure  stimuli  at 
different  frequencies  combine.  Kryter 's  criterion  stated  that 
the  level  of  the  noise  must  not  exceed  certain  specified  levels 
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(85  dt>  SPL  modified  as  indicated  above)  in  any  critical  band*. 
The  implication  is  that  critical  bands  contribute  inde- 
pendently to  hearing  loss . Unfortunately  this  assumption  can- 
not be  Justified  on  the  basis  of  present  laboratory  and  field 
data  (see  discussion  in  Sec.  17.^).  As  a matter  of  fact  the 
few  data  collected  by  Davis  and  others  during  World  War  II 
(for  levels  around  125  db  SPL)  showed  octave  bands  of  noise 
and  pure  tones  within  the  octave  to  be  equally  effective  in 
producing  hearing  loss,  provided  they  had  the  same  SPL. 

Since  most  of  our  measuring  Instruments  read  octave -band 
levels,  we  must  translate  our  critical  band  specifications 
into  octave  band  specifications.  Thus  if  the  criterion  spe- 
cifies 85  db  SPL  in  a critical  band  Af^,  it  should  specify 

Af 

85  + 10  log  db 

^ c 

in  an  octave  band  APq,  if  we  maintain  our  assumption  of  the 
Independent  contribution  of  critical  bands  to  hearing  loss. 

The  width  of  the  critical  band  as  a function  of  frequency  has 
already  been  shown  in  Fig.  16.19- 

Finally  we  come  to  the  problem  of  the  duration  of  the 
exposure.  The  contours  that  we  suggest  are  Intended  to  repre- 
sent safe  levels  in  terms  of  exposure  in  a lifetime  of  work. 

At  this  stage  we  are  hesitant  to  modify  the  criterion  levels 
toward  higher  values  for  exposures  that  are  less  severe,  but 
that  still  last  over  a period  of  months . For  very  short 
exposures,  however,  we  shall  indicate  that  higher  levels  can 
be  tolerated. 


* Kryter's  comments  on  the  question  of  critical  bands 
are  as  follows:  "it  has  been  assumed  by  Fletcher 

and  others  that  the  concept  of  critical  band  (the 
narrowest  band  around  any  frequency  which  just  masks 
that  frequency  when  of  equal  over-all  intensity) 
should  cover  not  only  masking  phenomena  but  also  the 
deafening  effect  of  noise . The  assumption  is  that 
since  the  addition  of  sound  frequencies  beyond  the 
critical  bands  does  not  contribute  to  the  masking 
effectiveness  of  the  noise  within  the  critical  band, 
these  additional  frequencies  likewise  contribute 
nothing  to  the  deafening  effect  of  the  noise  within 
the  critical  band" . 
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In  summary,  we  show  In  Pig.  l8.6  the  SPL  contours  for 
octave  hands  and  for  critical  hands  that  define  our  DR  criterion. 
The  following  qualifications  should  he  applied  when  this  DR  cri- 
terion Is  used: 

1.  These  contours  are  not  to  he  taken  too  literally 
since  deviations  of  the  order  of  1 or  2 dh  In  either  direction 
could  prohahly  he  disregarded.  Contours  such  as  these  should 
he  Interpreted  as  zones  with  some  uncertainty  attending  the 
measurement  of  the  exposure  stimulus,  and  hiologlcal  varia- 
bility modifying  the  probability  of  damage.  We  feel,  however, 
that  cdntours  10  dh  lower  would  Involve  negligible  risks 
indeed,  while  contours  10  dh  higher  would  result  In  significant 
Increases  In  hearing  loss . 

2.  The  levels  are  considered  to  be  safe  In  terms  of 
exposures  during  working  days  for  durations  up  to  a lifetime . 

3.  The  criterion  levels  apply  to  exposure  noise  that 
has  a reasonably  continuous  time  character  with  no  substantial 
sharp  energy  peaks . 

4.  For  wide-hand  noise,  the  curve  designated  "octave 
hands"  should  be  used.  For  pure  tones  or  for  noise  in  which 
the  major  portion  of  the  energy  Is  concentrated  In  a hand 
narrower  than  the  critical  band,  the  curve  designated  "critical 
bands"  should  be  used.  In  the  latter  case,  the  abscissa  should 
be  Interpreted  as  a logarithmic  frequency  scale  rather  than  a 
scale  of  octave  bands  of  frequency. 

5.  The  criterion  should  be  considered  as  tentative  only, 
and  Is  subject  to  further  revision  as  new  laboratory  and  field 
data  are  reported. 

Criteria  for  Short  Exposures . For  short  and  very 
Intermittent  exposures  of  the  order  of  minutes  In  duration, 
higher  levels  than  those  shown  in  Pig.  18.6  may  be  tolerated. 


Figure  l8.6 

Damage  risk  (DR)  criterion  for  steady  noise  and 
for  lifetime  exposures . The  upper  curve  applies 
to  situations  In  which  the  noise  Is  wlde-band; 
the  lower  curve  applies  to  pure  tones  or  to  bands 
of  noise  of  critical  width  or  less.  For  an  Inter- 
pretation of  these  curves.  Including  their  sta- 
tistical character,  see  text. 
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Frequency  Band  cps 


There  is  some  evidence  from  animal  experimentation  that  there 
Is  an  upper  limit  in  the  neighborhood  of  150  db  27/.  Stimuli 
of  that  intensity  seems  (at  least  in  anesthetized  guinea  pigs) 
to  produce  irreversible  damage  at  the  end  of  time  intervals 
of  the  order  of  one  -minute . There  are  Isolated  cases  in  which 
individuals  have  been  exposed  to  reliably  measured  levels 
higher  than  15O  db  SPL.  The  effects  (see  Sec.  17.^)  have  been 
of  a "prolonged  temporary"  nature,  and  might  have  produced 
Irreparable  damage  in  other  individuals.  We  recommend,  there- 
fore, that  even  for  rather  short  exposures  of  the  order  of  one 
minute  or  so,  levels  of  l45  db  to  150  db  should  not  be  exceeded. 
No  person  without  well-fitting  ear  protection  should  really  be 
exposed  to  levels  higher  than  140  db  for  any  appreciable 
length  of  time.  At  such  high  intensities,  the  overall  level 
of  the  noise  is  probably  the  best  measure  of  the  exposure 
stimulus,  since  the  effects  seem  to  be  more  or  less  inde- 
pendent of  frequency.  Let  us  also  not  forget  that  at  levels 
in  the  vicinity  of  15O  db  SPL  we  may  have  to  supplement  our 
criteria  designed  to  prevent  damage  to  hearing  by  criteria 
based  on  performance  of  the  organism  in  the  non-audltory  sphere. 

Example  of  a DR  Criterion  for  Impulsive  Noise.  We  pre- 
sent here  an  example  of  a DR  criterion  for  noise  that  is  not 
continuous  in  time  but  is  of  an  impulsive  character.  The 
criterion  was  proposed  specifically  for  the  protection  of 
operators  in  a drop  forge  shop  from  the  noise  of  their  drop 
hammers.  It  is  presented  here  to  indicate  some  of  the  factors 
that  must  be  considered  when  we  try  to  modify  the  DR  criterion 
for  steady  noise  and  apply  it  to  impulsive  noise . 

The  physical  characteristics  of  the  sound  from  a drop 
forge  have  been  described  in  Sec.  7.4,  Figs.  7.10,  7.11  and 
7.12.  The  stimulus  is  characterized  by  large  peaks  of  energy 
of  a few  milliseconds  duration.  The  sound  from  any  one  drop 
forge  in  this  particular  case  is  repeated  at  a rate  not  exceed- 
ing one  per  second.  Thus  the  average  SPL  from  a drop  forge  is 
much  lower  (about  20  db)  than  the  peak  SPL.  It  is  difficult, 
therefore,  to  obtain  numbers  that  adequately  describe  the  noise 
in  a simple  fashion. 

Information  on  temporary  and  permanent  effects  of 
exposure  to  short  tones  and  noises  has  been  reported  in  the 
literature  (see  Sec.  17.4)  and  a reasonable  estimate  of  a suit- 
able damage  risk  criterion  can  be  made  on  the  basis  of  these 
data.  Certain  relevant  data  are  summarized  in  the  following 
paragraphs . 
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Relatively  brief  exposures  of  a few  hundred  milli- 
seconds to  tones  at  levels  considerably  below  the  level  of 
the  DR  criterion  for  steady  sounds  are  followed  by  temporary 
upward  shifts  of  the  threshold  of  hearing  for  tones  of  the 
exposure  frequency  and  of  neighboring  frequencies.  The  magni- 
tude and  duration  of  this  temporary  shift  or  "fatigue*” 
Increases  as  the  Intensity  of  the  exposure  time  Is  Increased. 
When  the  level  of  the  exposure  tone  becomes  greater  than  a 
critical  value  In  the  range  90-100  db,  the  Increase  In  fatigue 
becomes  quite  sharp.  Further  data  show  that  the  short-time 
auditory  fatigue  Is  Independent  of  the  duration  of  exposure 
for  sounds  of  low  Intensity  level,  over  a range  from  l/lO 
second  to  10  seconds . As  the  level  of  the  exposure  time  Is 
raised  to  about  80  or  90  db,  however,  the  Induced  hearing  loss 
Increases  with  Increased  duration  of  exposure . Unfortunately, 
the  levels  Investigated  did  not.  In  general,  exceed  the  range 
9O-IOO  db  for  the  middle  and  high  frequencies . 

These  data  on  short-time  auditory  fatigue  Indicate  that 
a drop  forge  operator  must  undergo  considerable  temporary  hear- 
ing loss  after  each  hammer  Impact.  The  temporary  loss  rapidly 
decreases  within  one  or  two  seconds.  However,  the  temporary 
loss  may  not  decrease  to  zero  before  another  hammer  Impact 
occurs . 

There  Is  also  evidence  to  show  that  short  exposures 
(up  to,  say,  several  hours  continuously)  to  sounds  like  the 
Impact  sounds  from  the  drop  hammers  give  rise  to  temporary  hear- 
ing losses  that  persist  for  a few  hours  or  days.  Typical 
examples  of  the  effects  of  Intermittent  noise  are  hearing  losses 
reported  In  habitual  hunters  and  In  gunnery  Instructors . 

As  noted  above  In  the  discussion  of  criteria  for  short 
exposures,  there  Is  apparently  an  upper  limit  of  Intensity 
capable  of  producing  damage  to  the  hearing  mechanism  no  mat- 
ter how  short  the  duration  of  the  physical  exposure.  Prom  the 
small  amount  of  evidence  available  at  present  27, 28/,  this 
limit  will  be  placed  tentatively  between  150  and  I60  db  sound 
pressure  level  re  0.0002  dyne/cm2.  The  peak  levels  from  the 
drop  hammers  considered  here  are  always  less  than  I50  db  at 
the  operator's  ear  position,  and  consequently  we  shall  assume 
that  there  Is  no  permanent  damage  from  only  a single  Impact 
sound. 


See  Sec.  11 .k 
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The  damage  risk  criterion  previously  determined  for 
steady  noise  will  be  used  as  a basis  for  the  establishment  of 
upper  and  lower  limits  of  peak  sound  pressure  level,  between 
which  the  damage  risk  criterion  for  Impact  noise, must  lie. 

The  criterion  for  steady  noise  places  an  upper  limit  of 
about  95  dl>  on  the  sound  pressure  level  In  each  octave  band 
between  300  and  10,000  cps,  to  Insure  reasonable  freedom  from 
damage.  Impact  sounds  whose  peak  sound  pressure  levels  do  not 
exceed  95  db  In  these  octave  bands  are  therefore  assumed  to 
cause  no  permanent  damage  to  hearing,  even  when  they  are  re- 
peated many  times. 

An  upper  limit  beyond  which  there  Is  a definite  risk  of 
damage  may  be  determined  as  follows . The  average  sound  Inten- 
sity (and  hence  the  average  sound  Intensity  level)  to  which 
the  operator  Is  exposed  Is  computed  for  each  octave  band, 
assuming  fluctuations  In  sound  pressure  similar  to  those  de- 
picted In  Pig.  7.12.  If  the  levels  determined  In  this  manner 
are  greater  than  the  DR  criterion  level  for  steady  noise, 
there  Is  definite  risk  of  damage.  For  the  type  of  drop  ham- 
mers considered  here,  an  average  SPL  of  95  db  would  correspond 
to  a peak  SPL  of  about  115  db.  In  the  octave  bands  above 
300  cps . 

It  Is  almost  certain  that  the  best  criterion  lies 
between  the  two  criteria  discussed  above.  That  Is,  a cri- 
terion specifying  maximum  peak  levels  of  95  db  per  octave  band 
above  300  cps  would  be  too  stringent,  whereas  average  octave 
band  levels  of  95  db  would  not  afford  sufficient  protection. 

Using  known  experimental  data  on  short-time  auditory 
fatigue  29/,  a rough  extrapolated  estimate  can  be  made  of  the 
amount  of  cumulative  hearing  loss  Induced  by  tone  bursts 
repeated  at  a rate  of  one  per  second,  and  having  a level  of 
about  105  db.  The  data  Indicate  that  the  effect  of  the  first 
sound  would  have  practically  worn  off  by  the  time  the  second 
one  Is  delivered.  One  must  be  very  careful  when  one  Is  trying 
to  make  such  extrapolations,  since  It  Is  by  no  means  certain 
that  the  organism  behaves  like  mechanisms  that  obey  the  super- 
position principle  for  repeated  exposures . However,  we  may 
at  least  conclude  that  the  order  of  magnitude  of  our  sug- 
gested criterion  is  approximately  correct. 

Our  tentative  criterion  for  Impulsive  sounds  having 
the  character  (l.e.  time-characteristics,  spectra,  etc.)  of 
those  encountered  In  the  drop  forge  shop  can  then  be 
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formulated  as  follows;  the  peak  sound  pressure  level  within 
an  octave  band  must  be  no  more  than  105  db,  for  the  range 
from  300  cps  to  10,000  cps . This  criterion  yields  an  overall 
peak  sound  pressure  level  of  about  112  db  for  the  average  drop 
forge  spectrum. 

18.5  Miscellaneous  Noise  Criteria  for  Various  Environments 


The  criteria  for  noise  control  that  we  have  discussed 
in  the  previous  three  sections  have  been  based  on  certain 
specified  types  of  behavior  on  the  part  of  the  Individual  or 
individuals  who  are  exposed  to  the  noise . The'  three  types  of 
behavior  are:  (l)  Interference  with  residential  living, 

(2)  Interference  with  speech  communication,  and  (3)  loss  or 
partial  loss  of  auditory  function  as  a result  of  exposure  to 
high  Intensity  noise . Occasionally  the  type  of  behavior  that 
Is  involved  may  not  be  covered  by  these  three  categories,  or 
may  Include  more  than  one  category.  Several  such  situations 
are  discussed  In  this  section. 

Office  Spaces . Noise  In  office  spaces  may  Interfere 
with  several  types  of  activity.  It  Influences  the  ability 
of  the  occupants  to  talk  to  each  other  either  directly  or  on 
the  telephone;  It  may  be  responsible  for  or  associated  with 
fatigue,  annoyance  and  irritability.  It  may  indirectly  (via 
speech  communication,  for  example)  affect  the  efficiency  of 
the  workers . Thus  noise  In  office  spaces  does  not  only  In- 
fluence behavior  but  does  so  In  a rather  complex  fashion. 

In  an  effort  to  evaluate  the  noise  levels  that  are 
acceptable  to  people  In  office  spaces,  Beranek  and  Newman  7/ 
made  a survey  in  three  large  organizations : a metal  producing 

company,  a radio  equipment  factory,  and  an  educational  institu- 
tion. They  made  physical  measurements  of  the  noise,  and  also 
measured  people's  reactions  by  means  of  a questionnaire.  The 
questionnaire  Included  the  following  questions: 

(a)  Rate  the  noisiness  of  this  room  by  placing  an  "x" 
along  the  scale  below: 


////// 

very  quiet  moderately  noisy  very  intolerably 

quiet  noisy  noisy  loud 
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(b)  Rate  your  ability  to  use  the  telephone  under  the 
condition  of  noisiness  for  (a)  by  placing  an  "x”  along  the 
scale  below: 


L L L L 

satisfactory  slightly  difficult  impossible 
difficult 

(c)  On  the  same  scale  as  for  (a)  check  the  noise  rating 
that  you  feel  should  not  be  exceeded  if  you  are  to  accomplish 
your  duties  without  loss  of  performance. 

(d)  At  what  maximum  distance  in  feet  should  you  be  able 
to  converse  with  another  person  in  your  room  in  order  that  you 
may  do  your  duties  without  loss  of  performance? 

(e)  In  order  that  your  performance  not  be  affected-,  do 
you  consider  that  at  the  distance  Indicated  in  the  previous 
question  you  should  be  able  to  talk  in  a: 

L L L 

normal  slightly  loud 

(quiet)  voice  raised  voice  voice 

At  all  pertinent  locations  in  the  office  spaces,  the 
sound  pressure  levels  in  octave  bands  of  frequency  were  mea- 
sured, and  the  speech  Interference  levels  were  computed.  Since 
Interference  with  speech  was  believed  to  be  the  most  Important 
effect  of  the  noise,  it  was  concluded  that  the  SIL  would  provide 
a meaningful  single  number  that  described  the  noise  environ- 
ment . 


The  average  answers  to  questions  (a)  and  (b)  are  shown 
in  Pig.  18.7  as  two  curves.  These  curves  are  based  on  answers 
to  100  questionnaires.  As  the  curves  indicate,  the  office 
spaces  were  divided  into  two  categories:  (l)  private  offices. 


Figure  18.7 

Subjective  rating  of  noises  in  offices.  The 
curves  summarize  in  graphical  form  the  results  of 
100  questionnaires  Issued  during  a survey  of  noise 
in  office  spaces.  See  text.  (After  Beranek  auid 
Newman  7/) . 
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Telephone  use  satisfactory 


o 


3uf4Ba  OAT^oafqng 
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Speech  Interference  level  In  db 


Sound  pressure  level  In  band  - db  re  0.0002  dyne/cm 


CM 


20  150  600  2400 

75  300  1200  4800 

Frequency  band  In  cps 


Figure  18.8 

Graphical  relation  between  Judgment  of  comfort 
inside  the  cabins  of  commercial  transport  air- 
craft and  the  octave-band  pressure  levels . The 
six  regions  between  the  curves  are  defined  as 
follows;  (1)  Ideally  quiet,  (2)  comfortable, 

(3)  quasi-comfortable,  (4)  uncomfortable, 

(5)  very  uncomfortable,  (6)  intolerable.  (After 
Lippert  and  Miller  I7/) . 
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and  (2)  secretarial  and  drafting  offices  and  rooms  with 
business  machines.  The  average  answers  to  question  (c)  are 
shown  as  points  A and  B on  the  two  curves  of  Pig.  I8.7.  The 
average  answer  to  (d)  and  (e)  for  private  offices  was  "normal 
voice  at  9 feet,"  and  the  answer  for  secretarial  offices  was 
"slightly  raised  voice  at  3 feet." 

The  SIL's  of  40  db  for  private  offices  and  55  db  for 
secretarial  offices  can  be  used,  therefore,  as  approximate 
criteria  for  noise  control  In  these  spaces  (see  Table  l8.7). 

Cabins  In  Commercial  Aircraft . Llppert  and  Miller  30/ 
have  studied  the  problem  of  acceptable  noise  levels  Inside 
commercial  transport  aircraft  with  reciprocating  engines . 

Prom  their  experience,  they  Indicate  ranges  of  octave  band 
levels  that  would  be  classified  as  (l)  Ideally  quiet,  (2)  com 
fortable,  (3)  quasi -comfortable,  (4)  uncomfortable,  (5)  very 
uncomfortable  and  (6)  Intolerable*.  Unfortunately  no  evi- 
dence Is  supplied  that  would  permit  us  to  estimate  to  what 
extent  a representative  sample  of  passengers  would  agree  with 
these  Judgments.  Their  classification  Is  shown  graphically 
In  Pig.  18.8.  Curve  A Is  considered  as  defining  the  upper 
limit  of  comfort  for  an  airplane  noise  spectrum.  It  Is  of 
Interest  to  note  that  a noise  defined  by  Curve  A would  have 
a SIL  of  77  db,  and  would,  according  to  Table  18.5j  permit 
conversation  In  a very  loud  voice  at  one  foot . The  region 
defined  as  "comfortable"  would,  by  the  same  token,  define  an 
environment  In  which  conversation  Is  possible  In  a raised 
voice  at  three  feet. 

Typical  Criteria  for  Other  Spaces . Typical  criteria 
for  other  spaces  have  been  suggested  15/  In  terms  of  the  maxi 
mum  permissible  SIL  measured  when  the  room  Is  not  In  use. 


One  should  remember,  however,  that  there  Is  always 
a certain  amount  of  vibration  In  aircraft,  and  that 
man's  reaction,  especially  at  higher  levels,  may  be 
a composite  of  his  reaction  to  noise  and  to  vibration. 
Had  the  authors  described  In  more  detail  the  method 
by  which  their  classification  scheme  was  obtained  It 
would  be  easier  to  evaluate  the  contribution  that 
vibration  makes  to  the  concept  of  the  "Acoustical 
Comfort  Index . " 
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These  criteria  are  given  In  Table  18.8.  As  noted  In  Sec.  18. 3^ 
specification  of  the  SIL  alone  does  not  define  the  permissible 
levels  at  low  frequencies.  In  the  absence  of  empirical  vali- 
dation, It  Is  suggested  that  the  maximum  permissible  levels  at 
low  frequencies  be  those  defined  by  the  curves  In  Fig.  l8.5. 

Although  the  criteria  In  Table  l8.8  are  specified  In 
terms  of  maximum  permissible  SIL,  It  Is  reasonably  clear  that 
speech  communication  Is  not  the  only  human  function  that  Is 
affected  by  the  noise.  Questions  of  sleep,  comfort  and  annoy- 
ance are  also  Involved  to  a degree  that  cannot  easily  be 
quantified  at  present . 


TABLE  18.8 

CRITERIA  FOR  CONTROL  OF  BACKGROUND  NOISE  IN  VARIOUS  SPACES 

Type  of  Room  Msuclmum  Permissible  SIL 

(measured  when  room  Is 


not  In  use) 

Small  private  office 

40 

Conference  room  for  20 

30 

Conference  room  for  50 

25 

Movie  theatre 

30 

Theatres  for  drama 

(500  seats,  no  amplification) 

25 

Coliseum  for  sports  only 
(amplification) 

50 

Concert  halls  (no  amplification) 

20 

Secretarial  offices  (typing) 

55 

Homes  (sleeping  areas) 

25 

Assembly  halls  (no  amplification) 

25 

School  rooms 

25 
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On  the  other  hand,  Knudsen  and  Harris  16/  have  sug- 
gested in  the  following  table  recommended  noise  levels  for 
unoccupied  rooms. 


TABLE  18.9  1^ 

RECOMMENDED  ACCEPTABLE  AVERAGE  NOISE  LEVELS  IN 
UNOCCUPIED  ROOMS 


Type  of  Space 

Sound  Level* 
decibels 

Radio,  recording  and  television  studios 

25-30 

Music  rooms 

30-35 

Legitimate  theatres 

30-35 

Hospitals 

35-40 

Motion  picture  theatres,  auditoriums 

35-40 

Churches 

35-40 

Apartments,  hotels,  homes 

35-45 

Classrooms,  lecture  rooms 

35-40 

Conference  rooms,  small  offices 

40-45 

Court  rooms 

40-45 

Private  offices 

40-45 

Libraries 

40-45 

Large  public  offices,  banks,  stores,  etc. 

45-55 

Restaurants 

50-55 

The  levels  given  In  this  table  are  "weighted";  that 
Is,  they  are  the  levels  measured  with  a standard 
sound-level  Incorporating  a 40-db  frequency-weight- 
ing network  15/. 
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vibration.  At  the  present  time,  the  literature  con- 
tains little  beyond  laboratory  studies  In  the  area  of  acceptable 
levels  of  mechanical  vibration  31,32/.  As  high  sound  fields 
produced  by  Jet  engines  with  afterburners  become  more  prevalent. 
It  can  be  safely  predicted  that  empirical  studies  will  be  under- 
taken that  will  lead  to  the  establishment  of  at  least  approxi- 
mate criteria. 

Present  evidence  Indicates  that  the  presence  of 
vibratory  phenomena  In  connection  with  the  passing  of  aircraft 
(rattling  of  windows  and  dishes)  Increases  the  probability  of 
complaints  in  residential  areas.  As  better  data  on  such 
Incidents  becomes  available.  It  might  prove  necessary  to  take 
account  of  vibratory  phenomena  In  the  computation  of  the 
composite  noise  rating.  (See  Sec.  l8.2). 
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APPENDIX  1 


The  Ear;  A Little  Anatomy  and  Some  Facts  about  the  Auditory 

Nervous  System 

Some  of  you  may  have  wondered  why  we  have  not  started 
the  discussion  of  hearing  by  giving  a description  of  the 
structures  of  the  ear  and  of  the  organization  of  the  auditory 
system.  The  ability  of  a listener  to  distinguish  various 
frequencies  has  often  been  explained  by  assuming  the  presence 
of  resonators  In  the  Inner  ear  and  you  may  have  heard  of  the 
battle  between  the  resonance  theorists  (Helmholtz  ^/)  and  the 
so-called  telephone  theorists  (Rutherford  2/) . 

This  handbook  Is  obviously  not  the  place  to  compare 
critically  the  merits  of  various  theories  of  hearing*.  It 
might,  however,  be  quite  appropriate  to  discuss  the  ear's 
anatomy  (or  at  least  certain  of  Its  anatomical  features).  We 
might  also  want  to  review  the  numerous  papers  dealing  with  the 
electrical  signals  from  the  auditory  nervous  system,  provided 
such  discussions  are  capable  of  throwing  some  light  upon  the 
effects  of  noise  on  man.  However,  unfortunately  there  are  only 
a few  facts  that  have  a direct  bearing  upon  our  problems  at 
hand.  We  shall,  therefore,  limit  our  discussion  In  this  appen- 
dix to  a general  outline  of  certain  anatomical  and  physiological 
facts  with  emphasis  upon  those  considerations  that  are  relevant 
to  a discussion  of  the  effects  of  noise. 

The  human  ear  (see  Pig.  Al.l)  Is  commonly  described  as 
consisting  of  three  main  parts;  the  outer  ear  (includes  every- 
thing up  to  the  eardrum) , the  air-filled  middle  ear  cavity 
which  contains  the  three  ossicles,  and  finally  the  Inner  ear 


* Actually  most  of  these  theories  are  directed  toward 
the  phenomena  of  pitch  perception,  with  minor  empha- 
sis placed  on  such  aspects  of  auditory  behavior  as 
loudness  and  masking.  No  serious  overall  attempt  has 
been  made  to  develop  a comprehensive  theory  ranging 
from  speech  perception  to  the  after-effects  of  various 
sounds.  It  Is  undoubtedly  premature  for  such  an  under- 
taking, but  as  the  Interaction  between  man  and  noise 
becomes  more  Important  It  Is  to  be  expected  that 
theorists  will  want  to  turn  their  attention  toward 
these  neglected  aspects  of  man's  responses  to  sounds. 
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consisting  essentially  of  the  cochlea  and  the  vestibular 
apparatus.  The  vestibular  apparatus  Is  that  part  of  the 
anatomy  that  Is  Involved  mainly  In  maintaining  equilibrium. 

Let  us  now  see  how  these  parts  of  the  ear's  anatomy  Influence 
the  behavior  of  an  Individual  In  his  responses  to  sound. 

The  outer  ear  makes  its  Influence  felt  when  auditory 
thresholds  are  measured.  The  length  of  the  ear  canal  up  to 
the  eardrum  Is  approximately  one  Inch  long.  This  corresponds 
roughly  to  a quarter  wavelength  in  air  for  a 3000  cps  sound. 

This  tuning  accounts  for  a significant  part  of  the  difference 
between  minimum  audible  field  data  and  minimum  audible  pressure 
data  In  the  frequency  region  around  3000  cps  (see  Sec.  l6.2). 
Otherwise,  the  ear  channel  exhibits  In  Its  anatomical  details 
great  variability' from  man  to  man.  This  Is  a fact  which  we 
must  keep  In  mlr^d  when  we  consider  the  wearing  of  ear  plugs 
In  order  to  keep  unwanted  sound  from  reaching  the  middle  and 
Inner  ear. 

The  role  of  the  three  ossicles,  named  hammer  (malleus), 
anvil  (incus),  and  stirrup  (stapes).  Is  a fairly  Intricate  one. 
There  Is  some  evidence  that  these  three  smallest  bones  In  the 
humaui  body  act  as  an  Impedance  transformer  matching  the  acoustic 
Impedance  of  air  to  the  Impedance  of  the  fluid  In  the  Inner  ear. 
Bekesy  has  established  that  these  ossicles  have  at  least  two 
stable  modes  in  which  they  can  vibrate,  as  shown  In  Fig.  A1.2. 

The  transition  from  one  mode  to  the  other  occurs  for  certain 
displacement  amplitudes  which  exceed  a certain  critical  value*. 
For  hlgh-intenslty,  low-frequency  sounds  an  Increase  In 
stimulus  Intensity  Is  accompanied  by  a decrease  In  the  reported 
loudness.  There  Is  also  some  evidence  that  the  pain  or  prick- 
ling sensation  reported  by  observers  In  the  presence  of  very 
intense  sounds  may  be  related  to  a rubbing  of  one  of  the  ossicles 
against  the  tissues  that  cover  the  Inside  of  the  middle  ear 
cavity.  The  middle  ear  acts  further  as  a protective  device; 
there  are  two  small  muscles  In  the  middle  ear,  the  stapedius 
muscle  and  the  tensor  tympanl . While  their  action  Is  not  en- 
tirely understood.  It  Is  reasonably  certain  that  their  contrac- 
tion reduces  the  transmission  of  low  frequency  sounds  but  has 


* Under  this  circumstance,  the  amplitude  of  oscilla- 
tion transmitted  to  the  Inner  ear  Is  less  than  the 
amplitude  that  would  be  otherwise  transmitted  If  the 
ossicles  continued  to  vibrate  In  their  principal  mode. 
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Figure  A1 . 1 

Schematic  drawing  of  the  human  ear.  Sound  waves  enter  the 
external  meatus,  and  move  the  tympanic  membrane  which  sets 
the  three  ossicles  in  motion.  When  the  stapes  footplate  moves 
Inward,  the  perilymph  inside  the  cochlea  flows  In  the  dlrectlor 
of  the  helicotrema  and  makes  the  round-window  membrane  bulge 
outward.  All  these  movements  can  be  observed  with  the  aid  of 
a microscope . (From  B^kesy  and  Rosenbllth  and  Bekesy  . 
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Figure  A1 . 2 

The  two  modes  of  vlbratlonf  of  the  stapes.  (A)  Rotation  of 
the  footplate  about  a vertical  axis  through  its  far  edge  occurs 
in  response  to  weak  sounds.  The  fluid  is  pressed  into  the 
scala  vestlbull  by  the  motion  of  the  edge  of  the  stapes  nearest 
the  reader.  (B)  For  intensities  above  the  threshold  of  feel- 
ing, the  axis  of  rotation  coincides  with  the  longitudinal  axis 
of  the  footplate,  and  the  actual  volume  of  the  fluid  pressed 
into  the  scala  is  reduced.  (From  B^kesy  and  Rosenblith 
and  Bekisy  . 
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relatively  little  effect  In  the  frequency  range  above  1000  cps. 
Also,  It  takes  time  for  the  muscles  to  respond  to  high  Intensity 
sound.  It  takes  the  reflex  about  l/lOO  of  a second  to  become 
effective.  In  other  words,  there  Is  enough  time  for  quite  a 
few  high  Intensity  peaks  to  reach  the  Inner  ear  before  the 
muscles  become  effective . While  the  muscles  may  provide  a 
reasonable  amount  of  protection  for  low  frequency  steady  sounds, 
they  are  rather  Ineffective  against  sudden  noises . There  Is  no 
Information  on  the  way  In  which  the  muscles  behave  In  the  pre- 
sence of  waveforms  that  exhibit  repeated  peaks . 

The  Inner  ear  contains  a multitude  of  cavities  and 
channels  whose  shapes  are  sufficiently  complex  to  be  known  under 
the  technical  name  of  the  labyrinth.  We  shall,  therefore, 
attempt  to  give  only  a rather  simplified  description  of  details 
of  the  cochlea,  whose  role  In  the  process  of  hearing  In  mammals 
and  birds  Is  clearly  established.  In  man,  this  snail-like  coll 
(hence  the  name  cochlea,  which  means  snail  In  Greek)  Is  a rather 
flat  spiral  of  about  two  and  one-half  turns.  The  cochlea  Is 
subdivided  Into  three  channels,  two  of  which  are  filled  with 
liquid  (see  Fig,  AI.3).  The  third  channel,  the  cochlear  parti- 
tion that  separates  the  upper  from  the  lower  gallery.  Is  made 
up  of  several  fibrous  flexible  membranes  Including  the  basilar 
membrane.  It  contains  also  the  so-called  hair  cells,  their 
supporting  structures  such  as  the  organ  of  Cortl,  and  the 
tectorial  membrane . The  remainder  of  the  space  Is  filled  with 
a highly  viscous  liquid  called  endolymph.  This  middle  channel 
contains  In  addition  the  nerve  fibers  that  Join  to  make  up  the 
auditory  nerve . 

In  the  last  decade  our  knowledge  concerning  the  mechanical 
events  Inside  the  cochlea  has  advanced  greatly.  Practically 
all  the  Information  has  come  from  the  work  of  a single  man  - 
B^k^sy*,  whose  technique  Is  sufficiently  refined  to  enable  him 
to  make  measurements  Inside  the  cochlea  without  destroying  It 
or  without  seriously  Interfering  with  Its  functioning.  We  are 
now  not  only  able  to  specify  the  elastic  properties  of  the 
structures  that  make  up  the  cochlea  (see  reference  3/,  Pig.  27) 
but  we  can  also  state  what  the  pattern  of  vibration  along  the 
cochlear  partition  will  be  when  the  ear  Is  stimulated  by  tones 
between  50  and  about  3000  cps  (see  Pig.  A1.4). 


For  an  extensive  bibliography  of  the  papers  of 

G.  V.  Bekesy  see  the  references  at  the  end  of  Chapter 

27  of  the  Handbook  of  Experimental  Psychology. 
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We  thus  know  that  we  cannot  account  for  the  fine  dis- 
criminations In  pitch  of  which  men  and  animals  are  capable  on 
the  basis  of  the  resonance  of  the  simple  tuned  elements  that 
were  supposed  to  make-  up  the  basilar  membrane . The  resonance 
patterns  along  the  *cochlear  partition  are  rather  broad  and  the 
Implication  Is  that  we  have  to  look  for  some  mathematical  trans- 
formation or  other  mechanism  that  would  produce  a considerable 
amount  of  sharpening.  If  we  wanted  to  account  for  man's 
discriminatory  ability,  B^kesy's  work  (and  the  work  of  others 
who  have  dealt  essentially  with  electrical  aspects  of  the  ear's 
behavior  In  response  to  sounds)  makes  It  obvious  that  the  time 
Is  past  when  It  Is  safe  to  consider  an  organism's  behavior  In 
response  to  an  acoustic  stimulus  and  say,  "this  Is  what  the 
ear  does".  It  was  precisely  this  feeling  that  prompted  re- 
searchers to  look  for  further  events  along  the  auditory  path- 
way from  the  ear  to  the  brain,  events  that  were  related  to  the 
reception  of  sounds.  Most  of  the  work  done  in  this  area  falls 
Into  either  the  domain  of  electrophysiology  or  the  domain  of 
what  might  be  called  experimental  anatomy. 

The  auditory  nerve  after  being  formed  by  the  Junction  of 
the  nerve  fibers  from  all  parts  of  the  cochlea,  passes  through 
a tunnel  In  the  temporal  bone  and  Joins  the  brain  stem.  From 
there,  fibers  run  past  a few  neural  relay  stations  to  the  top 
layer  of  the  brain  called  the  cortex.  Little  would  be  gained 
from  going  Into  detail  on  the  anatomy  of  these  pathways  or  on 
the  various  auditory  areas  In ‘the  cortex  as  shown  In  Fig.  AI.5. 
Let  us,  however,  stress  that  there  are  projections  of  pathways 
from  either  ear  to  each  of  the  cortical  hemispheres  and  that 
such  multiple  pathways  and  other  safety  factors  Insure  safe 
delivery  of  the  stimulus  message  to  the  higher  centers  of  the 
brain.  This  fact  Is  underlined  by  the  results  of  removal 
(ablations)  of  various  parts  of  the  auditory  cortex  9/.  Simple 
discriminatory  abilities  of  animals  were  tested,  and  were  shown 
to  be  remarkably  unaffected  even  when  rather  large  parts  of  the 
auditory  cortex  were  removed.  At  the  present  time  we  have 
little  explanation  for  these  somewhat  startling  facts.  All  we 
C8U1  say  Is  that  behavior  In  response  to  sound  Is  a complex 
process  and  that  such  behavior  Is  Impossible  In  the  absence  of 
the  most  peripheral  portions  of  the  auditory  system  (i.e.,  the 
ear).  Without  an  Input  mediated  by  the  sense  organ  there  Is 
nc  output.  However,  at  least  parts  of  the  central  nervous  sys- 
tem can  be  removed  In  animals  without  affecting  simple  kinds 
of  behavior. 
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APPROX. 

SCALE 


Figure  A1 . 3 

Diagrammatic  cross-section  of  a cochlear  canal.  The  cochlear 
partition  includes  several  fibrous  flexible  membranes,  and 
contains  the  hair  cells,  the  ultimate  end  organs  of  hearing. 
(After  Davis  ^ and  Rasmussen  J/) . 
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Figure  A1.4 

Displacement  amplitudes  along  the  cochlear  partition  for 
different  frequencies . The  stapes  was  driven  at  a constant 
amplitude  and  the  amplitude  of  vibration  of  the  cochlear 
partition  was  measured.  The  maximum  displacement  amplitude 
moves  toward  the  stapes  as  the  frequency  Is  Increased. 

(Prom  Bekesy  and  Rosenbllth  and  Bekesy  8/) . 
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Figure  A1.5 

The  afferent  acoustic  pathway  (based  on  the  cat). 
The  locations  of  important  synapses  are  Indicated 
In  capital  letters  on  the  right.  The  "order"  of 
each  neuron  Is  Indicated  by  a number.  "Second- 
order"  neurons  are  separated  by  at  least  one 
synapse  from  the  sense  organ. 
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Within  the  last  two  decades  many  experiments  involving 
electrical  responses  from  various  locations  of  the  auditory 
system  have  been  reported.  We  cannot  attempt  to  review  here 
the  hundreds  of  papers  that  have  been  written.  There  are, 
however,  a few  general  facts  that  have  some  bearing  on  the 
problems  with  which  we  are  concerned.  First  of  all,  we  can 
say  that  electrical  activity  starts  practically  Instantaneously 
In  the  auditory  nervous  system  after  the  sound  waves  Impinge 
upon  the  eardrum.  Typical  electrical  responses  are  shown  In 
Pig.  AI.6.  The  delay  to  the  first  neural  electrical  i^lgnal  Is 
of  the  order  of  lAOOO  of  a second.  The  successive  locations 
In  the  nervous  system  are  activated  several  milliseconds  later 
and  electrical  activity  at  the  auditory  cortex  starts  about 
l/lOO  of  a second  after  the  Initial  click  sound  has  been 
delivered  to  the  ear.  These  time  intervals  are  remarkable  In 
several  respects.  The  minimum  reaction  time  to  a sound  stimulus 
is  of  the  order  of  120  to  150  milliseconds  10/.  In  other  words, 
we  can  at  present  account  only  for  a small  fraction  of  the  time 
necessary  to  elicit  behavior  In  response  to  acoustic  stimuli 
by  what  might  be  called  transmission  delays  In  the  afferent 
auditory  nervous  system.  It  Is  clear  that  It  takes  time  for 
tlie  muscles  to  go  Into  operation  guid  It  must  certainly  take 
time  for  the  activity  at  the  auditory  cortex  to  be  translated 
Into  activity  at  the  so-called  motor  cortex.  The  fact  that  an 
animal  lifts  Its  right  paw  In  response  to  an  acoustic  stimulus 
Is  a complex  affair;  we  are  certainly  not  ready  to  give  a neural 
Interpretation  of  such  matters  as  the  comprehension  of  speech. 


Figure  AI.6 

Tracings  of  electrical  responses  to  acoustic  clicks  recorded 
from  various  locations  of  the  auditory  system  (see  Pig.  A1.5). 
The  records  have  been  taken  from  the  data  pool  of  the  PsychO- 
Acoustlc  Laboratory  at  Harvard  University;  they  were  taken 
during  the  period  from  19^8  to  1952. 

The  acoustic  clicks  are  produced  by  applying  electric  square 
pulses  of  0.1  millisecond  duration  to  good  quality  earphones. 

The  sound  Is  conducted  from  the  diaphragm  of  the  phone  to  the 
eardrum  of  the  anesthetized  animal  by  means  of  a small  plastic 
tube.  The  resonances  of  the  entire  system  are  sufflclehtly 
damped  In  order  that  the  acoustic  system  as  a unit  can  show  a 
reasonably  flat  coupler  response  over  a range  of  about  5000  cps. 
The  coupler  cavity  Is  of  the  order  of  one  cc  for  animals  like 
cats  and  monkeys . Under  these  circumstances  most  of  the  energy 
In  the  click  Is  delivered  to  the  animal  In  about  one  millisecond 
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Figure  A1.6  (continued) 

There  Is,  however,  good  reason  to  assume  that  the  animal's 
electrical  responses  are  mainly  a function  of  the  Initial 
step  In  the  acoustic  click. 

The  responses  at  locations  1 and  5 were  recorded  by  fine 
platinum  wire  electrodes;  those  at  locations  2,  3 and  4 were 
obtained  with  high  Impedance  micro-electrodes  (mlcro-plpettes 
wh&se  diameter  was  of  the  order  of  5 mlcra) . The  responses 
shown  are  the  earliest  deflections  that  follow  delivery  of 
the  stimulus . Various  experimenters  have  shown  that  response 
activity  continues  sometimes  for  appreciable  fractions  of  a 
second,  and  In  particular  that  response  to  the  second  of  a 
pair  of  clicks  will  be  affected  during  these  time  Intervals. 

The  absolute  size  of  any  of  the  recorded  deflections  depends 
on  many  variables  and  Is  as  such  not  of  particular  Interest. 

At  all  locations  the  size  of  the  deflections  In  general  In- 
creases as  the  Intensity  of  the  stimulus  Increases;  maximum 
deflections  are  usually  of  the  order  of  several  hundred  micro- 
volts . 

Responses  to  acoustic  clicks  have  been  recorded  In  cats, monkeys, 
guinea  pigs,  hamsters,  bats  and  pigeons  (with  the  exception  of 
location  1 the  above  traces  come  from  cats;  the  records  at  lo- 
cation 1 come  from  a pigeon).  In  these  animals  responses  from 
comparable  locations  exhibit.  In  general,  similar  patterns  and 
are  characterized  by  similar  delays,  provided  the  animal's  body 
temperature  Is  normal . Response  to  clicks  are  visually  detect- 
able at  all  locations  of  the  auditory  nervous  system  while 
responses  to  pure  tones  do  not  exhibit  this  property  except  at 
the  most  peripheral  locations.  This  statement  should  not  be 
Interpreted  to  mean  that  there  are  no  responses  to  pure  tones 
at  the  auditory  cortex,  but  only  that  they  are  not  easily 
detected  with  our  present  methods  of  recording.  Responses  to 
clicks  can  also  be  shown  to  be  sensitive  to  "masking"  by  noise 
and  exposure  to  Intense  sound  (see  Rosenbllth  ll/) . 

Note  how  the  response  traces  of  location  1 'Illustrate  the  way 
In  which  responses  to  condensation  clicks  differ  from  rarefac- 
tion clicks  (in  the  first  Instance  the  diaphragm  of  the  earphone 
starts  by  moving  Inward  while  In  the  case  of  the  rarefaction 
click  It  starts  by  moving  away  from  the  eardrum) . The  earliest 
deflectldn  at  this  location,  labeled  M,  Is  the  most  prominent 
component  of  the  so-called  cochlear  microphone . These  mlcro- 
phonlc  potentials  generated  In  the  Inner  ear  of  the  animal  are 
useful  monitors  of  the  acoustic  stimulus.  Their  name  refers 
to  what  was  believed  to  be  the  mlcrophonlc  action  of  the  cochlea 
In  contradistinction  to  the  electrical  activity  labeled  N that 
originates  In  neural  structure  situated  near  the  end  organ. 
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When  the  ear  of  an  animal  is  exposed  to  strong  sounds 
certain  parts  of  the  neuro-electric  response  to  a standard 
click  stimulus  which  follows  the  strong  exposure  are  depressed 
for  time  Intervals  that  depend  on  the  severity  of  the  exposure, 
l.e.,  on  the  sound  pressure  level  of  the  stimulus.  Its  spectrum 
and  the  time  during  which  the  stimulus  Is  on  (see  Fig.  A1.7). 
When  the  exposure  becomes  severe  enough,  this  depression  of  the 
response  becomes  Irreversible  and  Indications  appear  that  there 
may  have  been  structural  damage  Inflicted  upon  certain  parts  of 
the  cochlear  partition.  There  Is  a rather  vague  general  rela- 
tion which  Indicates  that  those  regions  of  the  cochlea  show 
degeneration  that  are  more  directly  Involved  In  the  reception 
of  high  frequency  sounds.  Certain  studies  on  aged  patients  at 
the  Johns  Hopkins  Hospital  have  resulted  In  similar  findings 
for  what  were  apparently  advanced  cases  of  presbycousls  12/. 
Studies  of  a similar  type  on  animals  are  helpful  though  It  Is 
not  really  safe  to  extrapolate  In  order  to  estimate  what  might 
happen  In  the  case  of  man.  For  there  Is  not  only  much  varia- 
bility In  man's  reaction  to  high  Intensity  sounds  but  It  Is 
also  by  no  means  certain  that  the  action  of  the  middle  ear 
muscles  In  animals  and  men  have  the  same  or  even  comparable 
effects.  There  are  also  many  Instances  In  which  animal's  re- 
sponses to  sound  have  been  Irreversibly  affected  by  exposure 
to  loud  sounds  without  the  Investigator  being  able  to  demon- 
strate definite  structural  damage  by  histological  techniques. 

Electro-physiological  studies  on  animals  concerning 
reversible  auditory  fatigue  (the  depression  In  response  to  a 
click)  confirm  what  we  know  already  from  humain  audiograms. 

Loss  of  function  In  one  ear  leaves  the  performance  of  the  other 
ear  untouched  13/. 

Contrary  to  popular  belief,  the  eardrum  seems  to  play  a 
relatively  small  role  In  damage  produced  In  the  auditory  me- 
chanism by  the  exposure  to  high  Intensity  sounds . We  know  from 
various  researches  that  a small  hole  In  the  drum  will  have 
little  effect  on  hearing  and  perhaps  more  Important,  the  drum 
Is  In  some  sense  a safety  device  which  will  only  go  Into  action 
for  explosions  that  last  a relatively  long  time,  l.e..  In  which 
the  low  frequency  components  are  prominent . It  Is  In  this 
latter  circumstance  that  the  drum  may  be  tom  and  damage  may 
even  be  caused  to  the  ossicles . This  loss  of  drum  and  perhaps 
of  part  of  the  ossicular  chain  reduces  the  amount  of  sound 
energy  transmitted  to  the  Inner  ear. 
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Figure  A1 .7 

Effect  of  exposure  to  loud  tones  on  the  response  to  clicks 
as  recorded  from  the  round  window.  The  top  line  shows  typical 
responses  to  weak  clicks  (about  15  db  above  human  threshold) . 
The  lower  pictures  are  for  various  time  Intervals  after  ex- 
posures to  tones  of  200  and  2000  cps  for  10  seconds  at  a 
sound  pressure  level  of  about  105  db.  (l)  Both  neural  com- 
ponents are  reduced  In  approximately  the  same  proportions; 

(2)  recovery  Is  quicker  after  exposure  to  the  200  cps  tone, 
and  the  first  neural  component  becomes  supernormal;  (3)  the 
latency  of  N Is  Increased  In  the  early  stages  of  recovery 
after  the  2000  cps  exposure . (Prom  Rosenbllth  11/) . 
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APPENDIX  2 


Properties  of  Ear  Protectors  or  Aural  Protective  Devices 

There  are  many  Industrial  and  military  situations  In 
which  personnel  must  be  present  In  a hlgh-lntenslty  sound 
field  In  order  to  perform  their  tasks  properly.  It  may  not 
be  practical  or  even  possible  to  reduce  the  sound  pressure 
level  by  control  of  the  noise  at  the  source  or  by  protecting 
the  Individual  by  a shield  or  enclosure . Under  these  circum- 
stances, It  Is  possible  to  reduce  the  amount  of  sound  energy 
reaching  the  ears  of  the  Individual  by  the  use  of  ear  pro- 
tectors . During  World  War  II  Intensive  research  was  done  In 
this  area  (see  Shaw,  Veneklasen,  Stevens  et  al  3^) . 

By  far  the  major  portion  of  the  sound  energy  that  reaches 
the  inner  ear  Is  transmitted  via  the  ear  canal,  the  eardrum 
and  the  ossicular  chain.  The  usual  types  of  ear  protectors 
are  devised  to  minimize  the  amount  of  sound  energy  transmitted 
by  this  path.  This  path  Is  usually  called  the  air  conduction 
path.  In  contrast  to  the  bone  conduction*  path  through  the 
bones  of  the  head. 

Even  though  transmission  by  air  conduction  may  be  reduced 
to  a negligible  value,  sound  energy  may  still  be  transmitted 
through  the  bones  of  the  head  to  the  cochlea.  It  has  been 
shown  ^ that  for  persons  with  normal  hearing  at  certain  fre- 
quencies In  the  middle  range  bone -conducted  energy  Is  at  least 
50  db  below  the  energy  transmitted  by  air  conduction.  Thus 
there  Is  an  upper  limit  to  the  effectiveness  of  ear  protectors 
that  are  designed  to  reduce  the  air-borne  sound  energy.  Such 
protective  devices  cannot  be  expected  to  decrease  the  sound 
energy  reaching  the  Inner  ear  by  more  than  about  50  db,  when 
the  head  of  the  Individual  Is  In  the  sound  field.  Actually, 
other  considerations  seem  to  limit  the  effective  transmission 
loss  to  something  less  than  50  db . 

There  are  three  general  methods  of  occluding  the  ear  with 
devices  that  attenuate  air-borne  noise;  (l)  substances  In- 
serted Into  the  ear  canal,  so-called  "insert  types,"  (2)  objects 
covering  at  least  the  entrance  to  the  ear  canal,  and  often  the 
entire  outer  ear,  so-called  "cushion"  or  "doughnut  types," 


* The  most  Important  sources  of  Information  on  bone 
conduction  are  Barany’s  monograph  ^ and  several 
articles  by  Bekesy 
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(3)  fitted  coverings  for  the  major  or  entire  area  of  the 
head,  so-called  "helmet  types." 

Insert  types  Include  malleable  substances  that  can  be 
molded  by  the  wearer  to  fit  his  own  ear;  formed  soft  rubber 
plugs  either  vented  or  unvented;  hard  plastic  substances 
preformed  to  approximate  average  canal  dimensions  and  contours; 
permanent  molds  of  Individual  canals  made  from  casts;  metal 
and  soft  rubber  plugs  with  valves  designed  to  close  only  at 
high  sound  pressure  levels;  formed  soft  rubber  plugs  with 
metallic  weights  which  Increase  the  mass. 

Cushion  or  doughnut  types  Include  kapok-fllled  toroids, 
such  as  the  familiar  "doughnut  type"  earphone  socket  used  In 
flying  helmets;  round  rubber  enclosures  containing  perforated 
discs  which  help  to  discriminate  against  certain  portions  of 
the  audible  frequency  range;  ovoid  sponge  rubber  sockets  mounted 
In  a plastic  shell;  round  rubber  sockets,  lined  with  metal  and 
acoustic  absorbent  substances.  All  of  these  devices  are  kept 
In  place  by  headbands . 

Helmet  types  Include  flying  helmets  with  earphone 
cushions,  and  crash  helmets.  Some  effort  Is  at  present  being 
devoted  to  producing  a helmet  with  high  noise -attenuation  pro- 
perties. Such  a helmet  must  enclose  the  entire  head.  Including* 
the  face.  In  order  to  reduce  the  sound  energy  transmitted  to 
the  bones  of  the  head,  as  well  as  the  energy  transmitted 
through  the  normal  air  conduction  path. 

It  Is  clear  that  both  helmet  and  doughnut  type  ear 
protectors  can  be  combined  with  the  Insert  type. 

Provided  personnel  have  been  trained  to  accept  and  wear 
ear  protection,  the  most  Important  criterion*  by  means  of  which 
the  effectiveness  of  such  devices  Is  measured  Is  the  reduction 
of  sound  energy  (usually  expressed  In  decibels)  that  Is  pro- 
vided. The  reduction  must  be  specified  as  a function  of 
frequency,  since  It  Is,  In  general,  different  for  diverse 
frequencies . 


Other  criteria,  to  be  considered  later.  Include 
such  considerations  as  comfort,  ease  of  Insertion 
and  removal,  and  freedom  from  the  possibility  of 
Infection. 
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There  are  several  methods  of  measuring  the  performance 
of  ear  protectors.  Some  methods  involve  testing  by  physical 
means  only,  whereas  others  involve  psycho-acoustic  measure- 
ments. Each  method  is  discussed  briefly  below. 

1.  In  the  physical  procedure,  a small  enclosure  or 
coupler  containing  an  aperture  is  used  to  test  the  ear  pro- 
tector. If  an  Insert  type  is  to  be  tested,  it  is  Inserted 
into  the  aperture j if  a cushion  type  is  to  be  tested,  it  is 
mounted  over  the  aperture . A loudspeaker  or  other  sound 
generator  is  mounted  outside  the  enclosure,  and  a microphone 
is  used  to  measure  the  sound  pressure  Inside  the  enclosure. 

The  loud speaker -microphone  system  is  first  calibrated  with 
the  aperture  open.  Then  the  ear  protector  is  placed  in  or 
over  the  aperture,  and  the  reduction  is  measured.  There  is, 
of  course,  an  upper  limit  to  the  measurable  transmission  loss, 
determined  by  the  transmission  loss  of  the  walls  of  the  enclo- 
sure and  microphone . 

The  physical  procedure  does  not  take  into  account  the 
variables  encountered  when  protectors  are  placed  in  or  over 
human  ears.  Ears  vary  considerably  in  size  and  shape,  and 
hence,  there  are  considerable  differences  in  the  fit  of  the 
same  devices  used  in  different  ears . Thus  there  is  some 
variation  in  effective  transmission  loss  in  practice.  However, 
the  physical  measurement  procedure  should  provide  a reasonably 
reproducible  technique  for  the  evaluation  of  the  relative 
attenuation  provided  by  various  types  of  protectors  - a tech- 
nique that  could  be  standardized.  For  some  applications, 
therefore,  the  procedure  is  useful,  though  problems  of  fit 
in  the  ear  canal  and  static  pressure  on  the  ear  drum  have  to 
be  considered  when  physical  and  psycho-acoustic  measurements 
are  compared . 

2.  Two  psycho-acoustic  procedures  have  been  used  in 
order  to  measure  the  protection  provided  by  the  above  mentioned 
devices.  The  usual  method  is  to  measure  the  subject’s  absolute 
threshold  of  hearing  when  he  is  wearing  protectors  and  when  he 
is  not.  The  difference  is  the  attenuation  afforded  by  the  de- 
vice. More  realistic  data  are  obtained  if  free-fleld  measure- 
ments are  made,  but  reasonable  results  for  plugs  (but  not  for 
cushion  type  defenders)  may  be  obtained  if  the  signal  is 
presented  through  earphones . 

A second  procedure  that  is  occasionally  used  is  a loud- 
ness balance  technique . A tone  is  presented  to  a listener 
first  without  ear  protectors  and  then  with  protectors.  The 
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Intensity  of  the  tone  In  the  second  case  Is  raised  until  the 
subject  Judges  It  to  be  as  loud  as  the  tone  he  heard  without 
ear  protection.  The  Increase  In  Intensity  at  any  given  fre- 
quency Is  equal  to  the  attenuation  provided  by  the  device . 

In  practice  the  usual  procedure  Is  to  present  the  tone  through 
earphones . The  plug  Is  placed  In  one  ear  only,  and  the  rela- 
tive signal  levels  at  the  two  earphones  are  adjusted  until 
the  subject  Judges  the  tones  at  the  two  ears  to  be  equally 
loud.  The  difference  In  signal  level  at  the  two  ears  (assum- 
ing Identical  performance  of  the  two  ears,  and  that  Is  an 
assumption  that  ought  at  least  to  be  checked)  Is  the  attenua- 
tion of  the  plug.  Such  binaural  loudness  balance  techniques 
are  not  without  pitfalls  (see  Garner  . 

Figure  A2.1  shows  an  experimental  comparison  of  the  thres- 
hold shift  and  loudness  balance  techniques  for  one  type  of  ear 
protector. 

Miller,  Weiner  and  Stevens  7/  have  summarized  measure- 
ments of  threshold  shifts  for  various  types  of  aural  protective 
devices,  Some  of  their  data  are  shown  In  Pig.  A2.2,  plotted 
In  the  stamdard  form  of  audiograms . Most  of  the  devices  give 
an  attenuation  of  15-25  db  at  100  cps . The  attenuation 
gradually  Increases  to  a maximum  of  30-40  db  at  high  frequencies. 
The  perfection  of  fit  of  the  plug  has  a marked  effect  upon  the 
attenuation.  The  data  of  Pig.  A2.2  represents  the  probable 
maximum  attenuation  obtainable  with  well -fitting  plugs  of  the 
types  Indicated. 

The  attenuation  for  well-fitting  cushion  type  defenders 
Is  the  same  order  of  magnitude  as  that  of  plugs.  However,  If 
plugs  are  used  In  conjunction  with  cushion  type  defenders,  a 
higher  attenuation  can  be  obtained.  It  Is  believed  that  this 
combined  protection  approaches  the  theoretical  limit  Imposed 
by  bone -conducted  sound.  Greater  attenuation  can  be  obtained 
only  by  completely  enclosing  the  head  with  a helmet  constructed 
from  a material  that  provides  adequate  transmission  loss. 


Figure  A2.1 

A comparison  of  the  attenuation  for  one  type  of 
ear  protector  measured  by  the  threshold  shift 

and  loudness  balance  technique  (Prom  Miller,  ■» 

Wiener  and  Stevens  J/) . 
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The  use  of  defenders  effectively  gives  to  the  wearer  a 
temporary  conductive  type  hearing  loss . This  is  illustrated 
In  Fig.  A2.3i  which  Indicates  a subject's  threshold  with  and 
without  a protective  device . The  audiograms  were  obtained 
by  means  of  a continuous  recording  audiometer.  The  remarks 
In  Sec.  16.3  on  conductive  deafness  will  apply,  therefore, 
to  the  situation  In  which  protective  devices  are  used.  In 
particular,  the  comments  on  speech  perception  are  pertinent. 

The  effect  of  wearing  plugs  on  the  Intelligibility  of 
speech  In  noise  has  been  tested  experimentally  8, 9 A and  the 
results  are  presented  In  Pig.  A2.4.  The  intelligibility  of 
faint  speech  sounds  In  the  quiet  Is  greatly  reduced  by  the 
plugs.  For  very  Intense  speech,  certain  experimenters  have 
shown  a drop  in  articulation  score  In  the  absence  of  noise . 

No  such  decrease  was  observed  with  plugs  8/.  As  the  level  of 
the  masking  noise  Is  Increased  to  a point  where  the  effective 
masking  of  the  speech  Is  60  db  or  more,  ear  protectors  do  not 
interfere  with  speech  reception.  In  fact,  the  data  show  that 
for  high  noise  levels,  the  wearing  of  earplugs  actually 
Improves  the  Intelligibility  of  speech  In  noise . This  could 
be  attributed  to  the  fact  that  the  plugs  do  not  alter  the 
signal -to-nolse  ratio,  but  merely  lower  the  effective  level 
of  both  the  speech  and  noise  by  about  20  db.  This  fact  Is 
of  some  practical  Importance,  since  It  Indicates  that,  besides 
providing  protection  against  high-lntenslty  noise,  ear  pro- 
tectors Improve  the  ability  of  the  wearers  to  receive  instruc- 
tions or  orders  In  the  presence  of  noise . 

In  a recent  report  on  Problems  of  High  Intensity  Noise  10/ 
several  practical  considerations  that  have  Interfered  with  the 
general  use  of  ear  protection  are  examined.  The  following  dis- 
cussion reproduces  some  of  the  practical  points  raised  by 
Rosenbllth,  Wheeler  and  Smedal . 


Figure  A2.2 

Attenuation  In  decibels  provided  by  various  types 
of  ear  protectors  (insert  type).  The  attenuation 
is  determined  by  measuring  the  threshold  shift 
that  occurs  when  the  ear  Is  plugged.  (After  Miller, 
Wiener  and  Stevens  lJ , and  Kryter  2/) • 
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1.  size . Among  men,  ear  canals  differ  considerably 

In  cross-sectional  area  'ij , They  differ  also  In  the  direction 
and  tortuosity  of  their  course  leading  to  the  tympanic  cavity. 

As  a result,  the  problem  of  "fitting"  the  Individual  Is  the 
first  problem  encountered  In  promoting  the  use  of  ear  protec- 
tion. The  Individually  molded  substances  offer  no  difficulty 
here.  The  soft  rubber  forms  are  usually  supplied  In  several 
sizes,  from  which  the  Individual  Is  expected  to  select  the 
size .that  fits  his  ear  canals  best.  Guidance  should  be  given 
at  this  point  because  men  frequently  choose,  for  reasons  of 
comfort,  sizes  smaller  than  correct.  A protector  that  Is  too 
small  loses  much  of  Its  value  because  even  slight  leaks  de- 
crease the  protection. 

2.  Comfort . Ear  protectors  are  not  necessarily  uncomfort- 
able, but  the  fact  should  be  faced  frankly  that,  with  many  men, 
the  first  complaint  Is  likely  to  be  that  of  discomfort.  Often, 
however,  this  stems  from  a desire  to  avoid  using  the  protection 
rather  than  from  true  physical  discomfort  or  pain.  In  the 
matter  of  comfort,  the  best  procedure  seems  to  be  that  of  hav- 
ing a man  choose  the  type  of  protector,  as  well  as  the  size 

he  prefers.  It  Is  often  good  policy  to  have  several  types  of 
protectors  available  rather  than  to  fix  on  one  which  Is 
supposedly  "the  best."  Among  the  several  protectors  that  are 
acceptable,  there  Is  relatively  little  difference  In  the 
attenuation  they  yield.  These  differences  are  minor  compared 
to  the  difference  between  protection  and  no  protection. 

3.  Distribution.  The  best  hearing-conservation  programs 
will  distribute  protectors  under  the  supervision  of  persons 
who  have  both  Interest  and  experience  In  the  use  of  ear  protec- 
tion. In  the  Services,  this  person  might  typically  be  a 
medical  officer,  a nurse,  or  even  a hospital  corpsman.  The 
Indiscriminate  offering  of  protectors  usually  leads  to  a high 


Figure  A 2. 3 

A subject's  thresholds  with  and  without  a pro- 
tective device  are  shown  by  two  curves . The 
audiograms  were  obtained  by  means  of  a continuous 
recording  audiometer.  The  difference  between  the 
two  curves  at  any  one  frequency  gives  the  loss 
provided  by  the  plug  at  that  frequency.  (From  an 
unpublished  paper  by  D.  E.  Wheeler,  personal 
communication) . 
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number  of  rejections  unless  some  guidance  Is  offered;  demon- 
strations of  how  they  are  Inserted  and  removed,  of  how  to 
select  the  best  size  for  each  man,  some  discussion  of  what 
protectors  will  and  will  not  do,  all  contribute  to  a man's 
acceptance  of  ear  protection.  Because  certain  diseases  of 
the  ear  are  encountered  that  make  the  use  of  Insert  type 
protectors  undesirable.  Inspection  of  the  canals  ought  to  be 
carried  out  by  someone  competent  to  recognize  possible  compli- 
cations. Ordinarily,  this  Is  a matter  for  medical  supervision. 

4.  Care  and  upkeep.  Most  protectors  are  not  fragile, 
nor  do  they  deteriorate  rapidly  with  usage . The  manufacturer 
usually  supplies  a convenient  carrying  case  that  also  prevents 
gross  contamination  when  the  protectors  are  not  In  use.  The 
permanent  types  are  washable . Regular  cleaning  of  the  pro- 
tectors adds  to  their  useful  life,  and  cleaning  Is,  of  course, 
an  obvious  requirement  from  the  standpoint  of  hygiene . A 
contaminated  Insert  type,  or  one  that  has  an  accumulation  of 
hardened  wax,  presents  a rough  surface  to  the  lining  of  the 
ear  canal . Protectors  left  In  such  a condition  are  unnecessarily 
uncomfortable  and.  If  repeatedly  used,  might  very  well  contribute 
to  an  external  otitis.  For  these  reasons,  medical  supervision 
Is  Indicated  In  the  fitting  of  ear  protection  and  In  providing 
the  necessary  Instruction  and  Indoctrination. 

When  flight  personnel  wear  plugs,  they  should  be  cautioned 
of  the  possible  effects  of  a rapid  decrease  In  altitude*.  The 
resulting  unequal  pressure  on  the  two  sides  of  the  plug  may 
tend  to  push  the  plug  Into  the  ear  canal,  with  possible  damage 
to  the  canal  and  to  the  tympanic  membrane . The  flyers  should 
be  cautioned  to  loosen  the  plugs  before  descending,  or  to  wear 
plugs  that  have  a small  hole  for  purposes  of  pressure  equaliza- 
tion. 


Personal  communication  from  Major  Elizabeth  Guild, 
USAP. 


Figure  A2.4 

The  relation  between  the  percentage  word  articula- 
tion and  the  speech  level  with  and  without  earplugs . 
The  parameter  Is  the  level  of  the  masking  noise . 

NDRC  "type  V-51R"  earplugs  were  used.  The  data  show 
an  Improved  Intelligibility  In  the  presence  of  Intense 
noise . (From  Kryter  8/) . 
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